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(54) Bacterial strain and genome of bifidobacterium 



(57) The present invention pertains to a novel mi- 
croorganism of the genus Bifidobacterium longum, in 
particular to its genomic sequence and the nucleotide 
sequences encoding polypeptides of Bifidobacterium 
NCC2705 (CNCM i-2618), which are secreted or spe- 
cific or which are involved in the metabolism, In the rep- 
lication process, and to polypeptides encoded by such 
sequences as well as to vectors including the said se- 
quences and cells or non-human animals transformed 
with these nucleotide sequences and vectors, respec- 



tively. The invention also relates to transcriptional gene 
products of the Bifidobacteruim genome and to methods 
of detecting these nucleic acids or polypeptides. The in- 
vention eventually comprises a data carrier comprising 
the nucleotide sequence and/or polypeptide sequence 
of NCC2705 and also pertains to food and pharmaceu- 
tical compositions containing said microorganism forthe 
prevention and/or treatment of diarrhea brought about 
by rotaviruses and pathogenic bacteria containg said Bi- 
fidobcterium. 



PATHOGENS SENSITIVITY TO HUMAN BIFIDOBACTERIA NCC 2705 
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Description 

[0001] The present invention pertains to a novel microorganism of the genus Bifidobacterium longum, in particular 
to its genomic and plasmid sequence and nucleotide sequences encoding polypeptides of said Bifidobacterium to 
vectors Including the said sequences and cells or non-human animals transformed with these nucleotide sequences 
and vectors, respectively. The invention also relates to transcriptional and translational products of the Bifidobacterium 
genome and to methods of detecting these nucleic acids or polypeptides, respectively. The invention also relates to a 
data earner comprising nucleotide sequences and/or polypeptide sequences of NCC2705. In addition the present 
invention pertains to the Bifidobacterium longum strain NCC2705 and also to food and pharmaceutical compositions 
containg said Bifidobcterium or active components thereof for the prevention and/or treatment of diarrhea brought 
about by rotaviruses and pathogenic bacteria: J 
[0002] Organisms that produce lactic acid as a major metabolic component have been known since decades. These 
bacteria are normally found in milk or in milk processing factories, respectively, living or decaying plants but also in the 
intestine of man and animals. These microorganisms, summarized under the term "lactic acid bacteria", represent a 
rather inhomogeneous group and comprise the genera Lactococcus, Lactobacillus, Streptococcus, Bifidobacterium 
Pediococcus etc.. 

[0003] Lactic acid bacteria have been utilized by mankind as fermenting agents for the preservation of food taking 
benefit of a low pH and the action of products generated during the fermentative activity thereof to inhibit the growth 
■ of spoilage bacteria. In addition, lactic acid bacteria have also been used for preparing a variety of different foodstuff 
20 such as cheese, yogurt and other fermented dairy products from milk. 

[0004] Quite recently lactic acid bacteria have attracted a great deal of attention in that some strains have been found 
to exhibit valuable properties to man and animals upon ingestion. In particular, specific strains of the genus Lactobacillus 
or Bifidobactenum have been found to pass the gastro-intestinal tract in a viable and live form without getting destroyed 
in the upper part thereof, especially by the impact of the low pH prevailing in the stomach and be able to colonize the 
& intestinal mucosa, with their temporary or sustained maintenance in the gut being presumed to bring about numerous 
positive effects on the health of the living beings. These strains are generically termed probiotics. 
[0005] EP 0 768 375 discloses such a specific strain of the genus Bifidobacterium, that is capable to become im- 
planted in the Intestinal flora and may adhere to intestinal cells. This Bifidobacterium, is reported to assist in immu- 
nomodulation, being capable to competitively exclude adhesion of pathogenic bacteria to Intestinal cells, thus support- 
30 ing the maintenance of the individual's health. ... 

[0006] In view of the valuable properties particular strains of lacticacid bacteria may provide, there is a desire in the 
art for addrtional lactic acid bacterial strains that are beneficial to the well being of man and/or animal. In addition a 
more detailed information is desired relating to the biology of these strains, in particular as regards the interaction with 
the hosts, the phenomena of passing different environmental conditions in the gut as well as having the capability to 
adhere to the intestine's mucosa and eventually the involvement in the enhancement of the immune system and defense 
against pathogens, which information will allow a better understanding of these mechanisms. 

[0007] Consequently, a problem of the present invention is to provide substantial data about bacterial strains that 
exhibit properties beneficial for man and/or animals: 

[0008] In view of said problem, a subject of the present invention is the nucleotide sequence having the sequence 
SEQ. ID. No. 1 of the lactic acid bacterium Bifidobacterium longum NCC2705 genome and/or the nucleotide sequence 
,D - No 1 099 of tne P'asmid contained tehrein. The invention is, however, not limited to the sequences indicated 
in SEQ. ID. No. 1 and SEQ ID. NO. 1 099, respectively, but encompasses genomes and nucleotides encoding polypep- 
tides of strain variants, polymorphisms, allelic variants, and mutants thereof . 
[0009] In the figures: 

Fig. 1 shows.a graph, indicating the capability of Bifidobacterium longum NCC 2705 to adhere to human intestinal 
cells in culture; as a comparison another Bifidobacterium, strain BL28 Ca1 was used; 

Fig. 2 shows the pathogen sensitivity of pathogenic bacteria towards Bifidobacterium longum NCC2705; 

Fig. 3shows the activity of the cell lines NCC2705 and B1 28 against S. typhimurium SL1 344 infecting Caco-2 cells; 

Fig. 4 shows the rate of survival of mice infected with Salmonella typhimurium SL 1 344 and treated with the Bifi- 
dobacterium NCC2705. 

Fig. 5 shows a scheme illustrating the cell culture screening for assessing rotaviral protective properties of the 
bacterial strain NCC 2705. • 
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[0010] The present invention is based on whole-genome sequencing^ the genome of the Bifidobacterium longum 
strain NCC 2705, that has been deposited with the Institute Pasteur according to the Budapest Treaty on January 29 tt1 , 
2001 receiving the deposit no. CNCM I- 1-2618. 

[0011] In a first aspect the present invention relates to nucleotide sequences selected from the group comprising (a) 
the nucleotide sequence of SEQ. ID. No. 1 ; (b) a nucleotide sequence exhibiting at least 90% Identity with the sequence 
of SEQ. ID. No. 1; or (c) a nucleotide sequence that is homologous or hybridizes to SEQ ID. No. 1 under stringent 
conditions, 

[0012] In another aspect the invention relates to nucleotide sequences selected from the group comprising (a) the 
nucleotide sequence of SEQ. ID. No. 1 099; (b) a nucleotide sequence exhibiting at least 90% identity with the sequence 
of SEQ. iD. No. 1099; or (c) a nucleotide sequence that is homologous or hybridizes to SEQ ID. No. 1099 under 
stringent conditions. 

[0013] The terms genome or genomjc sequence shall be understood to mean the sequence of the chromosome of 
Bifodobacterium longum. The term plasmid shall be understood to designate any extrachromosomal piece of DNA 
contained in the Bifidobacterium of the present invention. Nucleotide sequence, polynucleotide or nucleic acid are 
understoodto desigante a double-stranded DNA, a single-stranded DNA ortranscriptional products of of the said DNAs 
at various length including oligonuclotides of about 10 to 100 nucleotides in length. 

[0014] A homologous nucleotide sequence according to the present Invention is understood to mean a nucleotide 
sequence having a percentage identity with the bases of the nucleotide sequence SEQ. ID. No. 1 or SEQ. ID. No. 1099 
of at least 80%, preferably 90% and more preferably 95%, 96%, 97%, 98% or 99 %. The said homologous may com- 
prise, e.g., the sequences corresponding to the genomic sequence or to the sequences of its representative fragments 
of a bacterium belonging to the Bifidobacterium species, preferably the Bifodobacterium longum species, as well as 
the sequences corresponding to the genomic sequence or to the sequences of its representative fragments of a bac- 
terium belonging to the variants of the species Bifidobacterium. In the present invention, the terms species and genus 
are mutually interchangeable. 

[001 5] These homologous sequences may thus correspond to variations linked to mutations within the same species 
or between species and may correspond in particularto truncations, substitutions, deletions and/or additions of at least 
one nucleotide. The said homologous sequences may also correspond to variations linked to the degeneracy of the 
genetic code or to a bias in the genetic code which is specific to the family, to the species or to the variant and which 
are likely to be present in Bifidobacterium. 

[0016] Protein and/or nucleic acid sequence homologies may be evaluated using any of the variety of sequence 
comparison algorithms and programs known in the art. Such algorithms and programs include, but are by no means . 
limited to, TBLASTN, BLASTP, FASTA, TFASTA, and CLUSTALW (see e.g. Pearson and Lipman, 1986, Proc. Natl. 
Acadi Scl. USA 85 (8): 2444-2448; Altschuletal., 1990, J. Mol.Blol.215 (3) : 403-410; Thompson etal., 1994, Nucleic 
Acids Res. 22 (2): 4673-4680; Higgins et al., 1 996, Methods Enzymol. 266: 383-402; Altschul et al., 1 990, J. Mol. Biol. 
215 (3): 403 - 410; Altschul et al., 1993, Nature Genetics 3: 266-272). 

[0017] In a particularly preferred embodiment, protein and nucleic acid sequence homologies are evaluated using 
the Basic Local Alignment Search Tool ("BLAST) which is well known in the art (supra). In particular, four specific 
BLAST programs have been used to perform the following task: 
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(1) BLASTP 

(2) BLASTN 

(3) BLASTX 

(4) TBLASTN 



Compares an amino acid query sequence against a protein sequence database 

Compares a nucleotide query sequence against a nucleotide sequence database 

Compares a nucleotide query sequence translated in all reading frames against a protein sequence 
database 

Compares a protein query sequence against a nucleotide sequence database dynamically 
translated in all reading frames 



[001 8] Among these representative fragments, those capable of hybridizing under stringent conditions with a nucle- 
otide sequence according to the invention are preferred. Hybridization under stringent conditions means that the tem- 
perature and ionic strength conditions are chosen such that they allow hybridization to be maintained between two 
complementary DNA fragments. Such conditions of high stringency may e.g. be achieved by carrying out the hybrid- 
isation at a preferred temperature of 65 °C in the presence of SSC buffer, e.g. 1 x SSC corresponding to 0.15 M NaCI 
and 0.05 M Na-citrate. The washing steps may be, for example, the following: 2 x SSC, 0.1 % SDS at room temperature 
followed by three washes with 1 x SSC, 0.1% SDS; 0.5 x SSC, 0.1% SDS; 0.1 x SSC, 0.1% SDS at 68 C for 15 minutes. 
[0019] The nucleotide sequence SEQ. ID. No. 1 and SEQ. ID. No 1099, respectively, has been obtained by sequenc- 
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ing the genome of and the plasmid contained in Bifidobacterium longurn NCC2705 by the method of directed sequenc- 
ing after fluorescent automated sequencing of the inserts of clones and assembling of these sequences of nucleotide 
fragments (inserts) by means of softwares. To this end, fragments of the genome were created, ligated into suitable 
vectors for amplification and propagation and the corresponding fragments were sequenced. Overlaps and the final 
arrangement of the fragments, the nucleotide sequence thereof, were assessed by the aid of approplate softwares. 
[0020] The present invention is further directed to nucleic acid molecules comprising open reading frames (ORFs) 
encoding Bifodobacterium longurn proteins. Therefore, according to another aspect the present Invention relates to a 
polynucleotide having a nucleotide sequence of an open reading frame (ORF) of a Bifidobacterium longurn genome 
comprising, (a) a nucleotide sequence chosen from any one of Seq ID. No. 2 to SEQ. ID. NO. 1 098; or (b) a nucleotide 
sequence exhibiting at least 95 %, preferably 96 %, 97 % 98 % and most preferably 99 % identity with any one of SEQ 
ID. NO. 2 to SEQ. ID. NO. 1098; or (c) a polynucleotide which hybridizes to any one of SEQ. ID. NO. 2 SEQ. ID NO 
1098 under conditions of high stringency. 

[0021] These nucleic acid molecules may be obtained, by e.g. specific amplification of the corresponding sequence 
using the polymerase chain reaction. Due to the sequence information provided herein the skilled person may design 
and synthesize any suitable primer nucleotide arid amplify a fragment of interest using the polymerase chain reaction. 
Therefore, the present invention also comprises nucleotide sequences selected from sequence SEQ. ID. NO. 1 which 
can be used as a primer for the amplification of nucleic acid sequences. Other techniques for amplifying the target 
nucleic acid may of course be also be used, such as e.g. the TAS (Transcription-based Amplification System) technique, 
the 3SR (Self-Sustained Sequence Replication) technique, the NASBA (Nucleic Acid Sequence Based Amplification) 
technique, the SDA (Strand Displacement Amplification) technique or the TMA (Transcription Mediated Amplification) 
technique etc.. ■ ■ ■ 

[0022] The (poly)nucleotides of the invention may also be used as probes and techniques for amplifying or modifying 
a nucleic acid serving as a probe, such as e.g the LCR (Ligase Chain Reaction) technique, the RCR (Repair Chain 
Reaction) technique, the CPR (Cycling Probe Reaction) technique orthe Q-beta-replicase amplification technique mav 

25 well be applied. V ' 

[0023] The present invention, therefore, envisages both hybridization (detection) probes and primers for detecting 
a nucleoide sequence (target nucleotide) of the present invention. In the case of the target being a RNA molecule, e. 
g. a rnRNA, said mRNA may be directly detected or transformed to a cDNA prior to detection: 
[0024] Alternatively, in order to obtain fragments of the nucleic acid represented by SEQ. ID. NO. 1 the Bifodobacte- 

30 riurn longurn genomic DNA may be subjected to digestion with selected restriction enzymes, with the fargments being 
separated by e.g. electrophoresis or another suitable separation technique. Such techniques are well known in the art 
and are inter alia disclosed in Sambrook et al. A Laboratory Manual, Cold Spring Harbor, 1992. Such fragments may 
easily be obtained by Isolating the genomic DNA of Bifodobacterium longurn NCC2705 (CNCM 1-2618) and performlnq 
the necessary steps. . 

35 [0025] In an alternative form the nucleic acids may also be obtained by chemical synthesis when they are not too 
large in size according to methods well known to a person skilled in the art. 

[0026] Modified nucleotide sequences shall be understood to mean any nucleotide sequence obtained by mutagen- 
esis according to techniques well known to a skilled person and exhibiting modifications In relation to the normal se- 
quences, for example mutations in the regulatory and/or promoter sequences for the expression of a polypeptide in 
40 particular leading to a modification of the level of expression of the said polypeptide or to a modulation of the replicative 
cycle. Modified nucleotide sequence will also be understood to mean any nucleotide sequence encoding a modified 
polypeptide as defined below. 

[0027] The subject of the present invention also includes Bifidobacterium longurn nucleotide sequences character- 
ized in that they are selected from a nucleotide sequence having an open reading frame as identified by SEQ ID NO 
45 2 to SEQ. ID. NO. 1098, shown in the table I below. 
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E-val 




o 


7.00E-71. 


7.00E-51 


. 1 ,00E-25 


i,00E-35 


o 


9.00E-95 


6,00E-88 


2.00E-60 


4,00E r 20' 


2.00E-44 


4.00E-68 


o 


. 1,00E-102- ; 


5.00E-72 


T 

UJ 

. o 
o 

eg 


5.00E-40 


3.00E-50 


6.00E-46 


5.00E-42 


1.00E-119 


5.00E49 


1 .00E-68 


2.00E-31 


10 
15 




Best Blast Hit ID 




TREMBLNEW.AF240579J 


PIR:H70705 


PIR:A70706 


PIR:G70816 


PIR:B70780 


PIR.T36384 


TREMBLN EW:AE00448 1 2 


TREMBLNEW:SC2G58_25 


PIR:T44251 


PIR:F75511 


PIR:B70506 '. 1 


PIRNEW:A82586 


PIR:C70724 


PIRT35147 


PIR:T35146 


PIR:T35145 


PIR:T35144 ■ . 


PIR:T2B698. 


PIR:D75557 


PIR:T17732 


TREMBLNEW:SC9C5_30 


PIR.H70592 


PIR:G70592 


PIR:S56453 
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25 
30 
35 


Table I 


Function . 




hsp60 heat shock protein 


phosphate response regulator protein phoP 


two-component sensor hlstidlne kinase 


cold shock protein 


Unknown . 


ATP-bindIng proteinase 


cytosine deaminase 


membrane transport protein 


Creatinase 


5'-nucleotidase 


Unknown 


histidyl-tRNA synthetase 


Aspartate-tRNA ligase 


glutamate uptake system ATP-binding protein -. 


glutamate-binding periplasmic protein 


glutamate transport protein gluC 


glutamate transport protein gluD 


acid phosphatase 


Unknown 


hellcase-like protein 


ATP/GTP binding protein 


sensory response regulator 


two-component sensory histidine kinase 


Periplasmic sugar-binding protein 


40 




Stop 




2455 


4447 


6469 


6926 


8124 


11960 


12146 


16670 


17514 


19479 


19691 


22845 


24480 


27288 


28159 


28836 


29942 


31572 


33576 


36277 


37739 


38519 


40213 


43239 


45 




Start 




in 

GO 


3674 


4622 


6542 


6940 


9354 


13462 


15297 


16759 . 


17689 


21148 


2124B 


22594 


26476 


27362 


28162 


28845 


30034 


32653 


33713 


36363 


37800 


38519 


42259 


50 




Code 




ORF3 


ORF7 


' ORF10 


ORF13 


ORF14 


. ORF23 


. ORF24 


ORF28 


ORF29 


ORF31 


ORF36 


; ORF38 


ORF42 


ORF47 


ORF48 


ORF49 


ORF50 


ORF51 


ORF57 


ORF60 


ORF64 


ORF65 


ORF67 


ORF74 


55 




Seq.lD 




OJ 

ci 
2 

d 

d 
ui 
co 

. 


SEQ.ID.No.3 


SEQ.ID.Np.4 


SEQ.ID.No.5 


SEQ. ID. No.6 


SEQ. ID. No.7 


. SEQ. ID. No.8 


. . SEQ.ID.No.9 • 


SEQ. ID. No.10 


. SEQ.ID.No.11 


SEQ. ID. No. 12 ■■ 


SEQ. ID. No.13 


SEQ. ID. No.14 


- SEQ. ID. No.15 


SEQ, ID. No.16 


SEQ. ID. No.17 


SEQ. ID. No. 18 


SEQ.ID.No.19 


SEQ. ID. No.20 


CM 
Q 

2 
Q 

d 

LU 
CO 


SEQ.ID.No.22 


SEQ.ID.No.23 


SEQ.ID.No.24 


SEQ.ID.No.25 
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E-val 


1/00E-1O0 


2.00E-44 


4.00E-30 


1.00E-54 


2.00E-30 


7.00E-56 


. 4.00E-53' 


4.00E-22 


3.00E-23 


1.00E-131 


2.00E-79 


1.00E-45 


LU 
O 

o. 

OJ 


3.00E-43 


3.00E-25 


2.00E-36 


5,00 E-20 


4.00E-67 


5.00E-21 


1.00E-120 


o 


6.00E-40 


6.00E-44 


4.0OE-22 ' 


3,00 E : 64 


10 
15 




Best Blast Hit ID 


& 

"3- 

CM 

co 
< 

LU 
2 
IX 
Cl 


PIR:S56456 


PIR:S56457 


TREMBLOOE250422_3 


PIRNEW:T50667 


PIR:D69159 


PIR:E75472 


PIR:T37220 


PIR:B75270 


PIR:T35810 


TREMBLNEW:SCP8_15 


TREMBL:SCE59_3 


PIR:A69252 . 


TREMBL:SCF41_25 


PIR:A69760 


PIR:S76205 


PIRNEW:F82198 


PIR:B7065B 


PIR:A70658 


PIR:D70959 


PIR:A70990 


PIR:B49930 


PIR:C70899 


PiR:C72360 


~i 

CO 

O 

s 

t 

m 
JJ 
m 

LU 

rr 
l— 


SO 

25 
30 
35 


Table i (continued) 


Function 


nbose ABC transporter, ATP-binding protein 


carbohydrate transport protein 


high affinity sugar transport protein rbsC 


Permease 


2-hydroxyacld dehydrogenase family 


methyl coenzyme M reductase system, component A2 


polyphosphate glucokinase 


aminotransferase 


ABC transporter, ATP-binding protein 


DNA ligase 


Unknovn (ATP-binding protein) 


■ ■■- 

sugar transport protein 


3-hydroxyacyl-CoA dehydrogenase 


Unknown 


Unknown 


Unknown 


ABC-type transport system ATP-binding protein 


translation elongation factor 


transcription termination protein 


carbamoyl-phosphate synthetase 


Carbamoyl-phosphate synthase large chain - 


orotidine-5'-phosphate decarboxylase 


guanylate kinase 


DNA polymerase III 


translation elongation factor EF-G homolog 


40 




Stop 


44921 


45993 


47012 


47241 


49977 


52521 


58835 


60319 


61152 


69948. 


71239 


73776 


77984 


79272 


81182 


81215 


82012 


83465 


84092 


85506 


88891 


89B11 


90581 


90589 


93692 - 


45 




Start 


43383 


44926 


45993 


48695 


49084 


54950 


58071 


59063 


60331 


67189 


70127 . 


72964 


77028 


78166 


80448 


B2012 


82797 


82902 


83523 


83803 


85511 


88894 


89994 


91293 


91440 


SO 




Code 


ORF75 


ORF76 


ORF78 


ORF79 


ORF83 


ORF90 


ORF99 


ORF101 


ORF103 


ORF113 


ORF114 


ORF119 


ORF129 


ORF131 


ORF137 


ORF139. 


ORF141 


ORF145 


ORF146 


ORF147 


ORF148 


ORF149 


ORF151 


ORF152 


ORF156 


55 




Seq.lD 


SEQ.ID.No.2S 


SEQ. ID. No.27 


SEQ.ID.No.28 


SEQ. ID, No.29 


SEQ. ID. No.30 


SEQ, ID. No.31 


SEQ. ID. No.32 


. SEQ. ID. No.33 


SEQ.ID.No.34 


SEQ. ID. No.35 


SEQ.ID.No.36 


SEQ. ID.Nb.37 


SEQ. ID. No.3B. 


SEQ. ID. No.39 


SEQ.ID.No.40 


SEQ. ID. No.41 


■ SEQ.ID.No.42 


SEQ. ID. No.43 


. SEQ.ID.No.44 


SEQ. ID. No.45 


SEQ.ID.No.46 


SEQ. ID. No.47 


SEQ.ID.No.48 


SEQ: ID. No.49. 


SEQ. ID. No.50 
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E-val 


4.00E-53 


5.00E-49 


7.00E-64 


1.00E-72 


56-300'B 


4.00E-78 • 


. 2.00E-46 


cn 

CO 

LU 
o 
o 

CO 


5.00E-30 


B.OOE-37 


3.00E-82 


5.00E-35 


1.00E42 


1.00E-24 


co 
o 

UJ 

o 
o 


1.00E-135 ; 


7.00E-47 


5.00E-77 


IO 
CM 

tii 
o 

O 


6.00E-26 


3.00E-91 


1 .00E-92 


3.00E-31 


5.00E-30 


3.00E-48 


10 

15 




Best Blast Hit 


PIR:D71849 


PIR:G81391 


PIR:F71849 


TREMBL:U60994_4 


CM 

co 

CM 

r— 

? 
5 

LU 
2 
CC 
o_ 


TREMBLREU010061J 


CO 

1 

to 

CM 

CO 

o 
o 
CJ> 
< 

tn 

LU 

rr 

r- 


PIR:C69978 


PIRT35113 


TREMBLNEW:SCPB_24 . 


CM 
I 

CO 
CM 
O 
CO 

LU 
2 
_1 

co 
2 

LU 
CC 
(- 


TREMBLNEW:SC23B6_23 


PIR:T36157 


PIR:T44716 


PIR:A70923 . 


PIR:T35407 


TREMBL:SCC121_13 


. PIR:H70585 - • 


PIR:C72369 


PIR:D72369 


CM 
CO 

CM 

d: 
a. 


PIRiGRECST 


CM 
CO 
CM 
CM 

r- 

Lu 

cr 

DL 


SWISSPROT:NEUS_MOUSE 


PIR;G64937 


20 
25 
30 
35 


Table I (continued) 


. Function 


amino acid ABC transporteri permease protein 


arnlno-acld ABC transporter integral membrane protein - 


amino acid ABC transporter, ATP-binding proteiri 


amino acid binding protein 


cystathionine beta-iyase 


transposase 


Unknown 


glutamate racemase 


Dlaminopimelate.epimerase .. 


Unknown 


ATP-dependent DNA helicase 


Unknown 


Unklovn 


Unknown 


1-deoxy-D-xylul08e 5-phosphate reductoisomerase 


peptidoglycan acetylation 


Unknown . 


undecaprenyl phosphate synthetase 


Unknown 


ABC transporter, permease protein 


sucrose hydrolase 


sucrose transport protein 


transcription regulator, LacI family 


Serpin 


membrane transport protein 


40 




Stop 


cn 
co 

CM 

cn 


9590.0 


96684 


97652 


99126. 


99774 


103459 


104506 


106295 


108069 


111878 


111954 


114795 


115327 


118686 


9066 U 


12250B 


123299 


124787 


126116 


127094 


. 128661 


130213 


131297 


133791 


45 




Start 


94496 


95229 
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Best Blast Hit ID 
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Table I (continued) 


Function 


phosphomethylpyrimidlne kinase 


thiamin biosynthesis protein thIA 


hydoxyethylthiazole kinase 


glycyl-tRNA synthetase 


transcription regulator 


ftsZ protein 


ribosomal large subunit pseudouridine synthase 


DNA-dlrected DNA polymerase III alpha chain 


protoporphyrinogen oxidase 


transposase. 


transposase 


ph age Infection protein 


phage infection protein 


NADPH-flavin oxidoreductase 


Unknown . . 


Sugar-binding protein 


Transcriptional regulators of the Lad family 


sugar transport system pemxase 


ABC transporter sugar permease 


arablnosldase 


transposase 


transposase 


probable transposase 


Unknown 


aromatic amino acid transport protein 
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Table 1 (continued) 


Function 


transcriptional regulator (TetR/AcrR family) 


ABC transporter, ATP-blndlng protein 


amino acid permease 


Unknown 


Unknown 


DNA-damage-inducible protein 


ABC transporter (ATP-binding protein) 


ABC transporter 


Unknown 


methionlne-tRNA ligase 


aminopeptldase 


Unknown 


lacl-family transcriptional regulator 


alpha galactosldase 


ABC transporter 


ABC transporter (ATP-binding protein) 


alpha-L-arablnofuranosidase I 


arabinan endo-1,5-alpha-L-arabinosidase homolog 


arabinan endo-1,5-alpha-L-arablnosidase 


transcrlptlon regulator, Lad family 


alpha-L-arabinofuranosidase II 


ABC transporter sugar permease 


membrane transport protein 


Unknown 


potassium uptake protein 
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Best Blast Hit ID 


PIR:T00092 


TREMBLAF033015 6 
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Table I (continued) 


Function 


Unknown 


glycosyltransferase homolog 


polysaccharide ABC transporter 


ABC-type transport protein 


glycerol-3-phosphate cytidylyltransferase 


Unknown 


Glycosyllransferase-type protein 


transferase 


Unknown 


Unknown 


. transposase 


probable transposase 


transposase . 


transposase 


glucose-1 -phosphate thymidylyltransferase 


dTDP-4-keto-6-deoxyglucose-3,5-epimerase 
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polysaccharide biosynthesis (cps-like) 


2-succinyl-6-hydroxy-24.-cyclohexadiene-1-caiboxylate 
synthase 


dehydrogenase-IIke 


glycosyltransferase 


putative rhamnosyl transferase 


glycosyl transferase 


transposase 
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. Best Blast Hit ID 
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Table! (continued) 


. Function. 


transposase. 


U ndecapreriyl-phosphate glucose 1 -phosphate transferase 


multidrug resistance protein 


transmembrane protein • 


Unknown- 


oxidoreductase 


arablnogalactanendo-1 ,4-beta-galactosldase 


transcription regulator, LacI family 


beta-galactosidase 


ABC transporter sugar permease 


ABC transporter sugar permease j 


solute-blnding lipoprotein 


ABC transporter 


long chain fatty acid coA ligase - 


ABC transporter, ATP-bindlng protein 


vexl(vexp1 ) homolog 


ABC transporter 


Unknown 


L-arablnose Isomerase 


L-rlbulose-phosphate 4-eplmerase 


xylulose kinase 


Transcriptional regulators of the LacI family 


ribonuclease HII 


signal peptidase 1 
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Best Blast Hit ID 
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Table I (continued) 


Function 


Antibiotic resistance factor 


pepdldoglycan biosynthesis 


beta-lactam resistance factor 


Unknown 


ribonuclease PH 


Unknown 


Ion Channel 


Unknown . 


ribonucleas 


Acetolactate synthase 


acetolactate synthase 


ABC transporter 


amino transferase 


cysteine-tRNA ligase 


signal recognition particle protein (ffh) 


305 ribosomal protein S1 6 


Unknown 


inoslne-urldine preferring nucleoside hydrolase 


oxidoreductase 


Unknown 


phosphate ABC transporter . 


phosphate transport system permease protein A- 


phosphate transport system permease protein 


Phosphate-blndingprotein 


two-component response regulator 
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381282 
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ORF666 


ORF669 


ORF671 


55 




Seq.lD 


SEQ. ID. No. 175 


SEQ. ID. No.176 


SEQ. ID. No.177 


SEQ. ID. No.178 


SEQ.ID. No.179 


SEQ. ID. No.1B0 


SEQ. ID. No.181 


SEQ. ID. No.182 


SEQ. ID. No.1B3 


SEQ.ID. No.184 


SEQ.ID. No.185 


SEQ.ID. No.186 


SEQ.ID. No.187 


SEQ. ID. No.188 


SEQ. ID, No.189 


SEQ.ID. No.190 


SEQ. ID. No.191 


SEQ.ID. No.192 


SEQ. ID. No.193 


SEQ.ID. No. 194 


SEQ.ID. No.195 


SEQ. ID. No. 196 


SEQ.ID. No.197. 


SEQ. ID. No.198 


SEQ. ID. No.199 
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E-val 


T 

ai 
o 
o 

CO 


CM 

•CO 

UJ 

o 
o 


2.00E-96 


3.00E-88 


I.00E-21 " 


4.00E-26 


9.00E-32 


2,00 E-33 


2,00E-65 . 


8.00E-53 


1 ,00E-26 


CO 

o 

f — 
lii 

o 
o 


7,00E-86 


C\l 
UJ 

o 
o 

CO 


1.00E-124 


2,00 E-52 


2.00E-27 


4.00E-29 


6.00E-75 


4.00E-79 


2.00E-25 


1.00E-60 


o 


1.00E-62 


o 


10 

is 




Best Blast Hit ID 


PIR:E70744 


PIR:1 40837 


PIR:ADECHF 


PIR:S52865 


TREMBL:SCBF4_ 


PIR:S23432 


PIR:S60902 


PIR:T34951 


TREMBLNEW:AP001511 163 


PIR:E70927 


PIR:C70671 ' 


PIR:T35650 


PIR:B70078 


PIR:T34664 • 


TREMBL:AF18149B_4 


PIR:T35802 


PIR:E81344 


TREMBLNEW:AE004747_11 


PIR:B70673 


PIR:T35643 


TREMBL:SCI41_22 


P.IR:E70769 


PIR:S37547 


CD 
"~l 

u> 
C3 

CO 
CM 

o 

CO 

s 

LU 

z 
_l 
m 

UJ 
CC 

1- 


TREMBLNEW:SC26G5_15 


20 
25 
30 
35 


Table I (continued) 


Function 


two-component sensor histidine kinase 


2-dehydro-3-deoxyphosphoheptonate aldolase 


2-dehydro-3-deoxyphosphoheptonate aldolase 


cysteine aminopeptldase 


Unknown 


•pyroglutamyl-peptidase I 


CDP-ribltol pyrophosphorylase 


Unknown 


protease 


tRNA (guanlne-NI)-methyltransferase 


methylase 


ATP-dependent DNA helicase 


Unknown 


methylated-DNA-proteln-cystelne methyltransferase 


L-aspartase 


polyamine ABC-transporter ATP-binding protein 


ABC transport system permease 


Unknown 


D-alanlne:D-alanine iigase 


glycerol-3-phosphate dehydrogenase 


peptidylprolyl isomerase 


Unknown 


H+-transportlng ATP synthase beta chain - 


ATP synthase gamma subunit 


H+-transporting ATP synthase 


40 




Stop 


413910 


416327 


417688 


419113 


420571 


422296 


422983 


423938 


424996 


427525 


427555 


428393 


434409 


436485 


440586 


442847 


444980 


445942 


447361 


448735 


456889 


456941 


458092 


459573 


460500 


45 




Start 


415238 


. 417556 


418818 


420453 


422145 


422970 


423855. 


424915 


. 426819 


426773 


428172 


431233 


433135 


435955 


439060 


444121 


44616.1 


447105 


448644 


449733 


455903 


457747 


459561 


460493 


462128 


50 




Code 


ORF672 


ORF676 


ORF677 


ORF678 


ORF679 


ORF6B1 


. ORF683 


ORF6B4 


ORF686 


ORF689. : 


ORF690 


ORF691 


ORF696 


o 

Lt 

CC 

o 


ORF709 


ORF713 


ORF718 


ORF720 


ORF721 


CO 
CM 

cc 
o 


ORF737 


ORF740 


ORF742 


ORF744 


ORF747 


55 




Seq.lD 


o 
o 

CM 

6 
Z 

g 
d 

UJ 

co 


SEQ. ID. No.201 


SEQ. ID. No.202. 


SEQ. ID. No.203 


SEQ. ID. No.204 


SEQ. ID. No.205 


. SEQ. ID. No.206 


SEQ. ID. No.207 


SEQ. ID. No.208 


SEQ. ID. No.209 


SEQ.ID. No.210 


SEQ. ID. No.211 


SEQ.ID. No.212 


SEQ.ID. No.213 


SEQ.ID. No.214 


. SEQ.ID. No.215 


SEQ.ID. No.216 


SEQ.ID. No.217 


SEQ.ID. No.218 


SEQ.ID. No.219 


SEQ.ID. No.220 


SEQ.ID. No.221 


SEQ.ID. No.222 


SEQ.ID. No.223 


SEQ.ID. No.224 
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E-val 


OJ 
LU 
O 
O 

">t" 


1.00E-21 


2.00E-88 


1,00E-22 


2.00E-27 


2.00E-47 


1.00E-174 


2.00E48 


6.00E-27 


5.00E-66 


6.00E-58 


1.00E-149 


1.00E-151 


1.00E-29 


1.00E-170 


1.00E-158 


to 
co 

LU 
O 

o 

CO 


7.00E-55 


1.00E-28 . 


1.00E-113 


LU 
O 
O 

03 


1.00E-129 


2.00E-25 


6.00E-39 


1;00E-21 


10 
15 




Best Blast Hit ID 


TREMBLNEW:SC26G5_14 


PIR:S37541 


TREMBLNEW:AP001515_14 


TREMBL:BPH6589_4 


TREMBL:LLHORF02_9 


PIR:B70965 


TREMBL:SC6A11_19 


PIR:T36335 


PIR:F81434 


PIR:JN0831 


PIR:C70790 


PIR:B71675 


TREMBLMLRHOJ 


PIR:B71153 


PIRT35817 


PIR:T35815 


co 

CO 
CO 

o 

a 

tr 

Q_ 


TREMBL:AF213822_7 


PIR:A70907 


TREMBL:AF129004_1 


PIR:B72254 


TREMBLNEW:AP0015i7_195 . 


PIRT36596 


PIR:F70913 


PIR:E70913 


20 
























































25 
30 
35 


Table I (continued) 


Function 


ATP synthase delta chain 


H-f-transportlng ATP synthase 


homoserine O-succlnyltransferase 


Bacteriophage portal protein 


Unknown 


integrase 


alpha-amylase 


inorganic pyrophosphatase 


Unknown 


two-component response regulator 


Endonuclease III 


vallne-tRNA ligase 


transcription termination factor Rho 


Unknown 


Glu-tRNA(Gln) amidotransferase chain B 


Glu-tRNA(Gln) amidotransferase chain A - 


Unknown 


Unknown 


Unknown 


copper-transporting ATPase 


glycerol uptake facilitator protein 


phosphotransferase system enzyme I 


ribosomal protein L9 


single-strand binding protein 


ribosomal protein S6 


40 




Stop 


462207 


463984 


46525B 


473587 


479179 


483079 


. 488831 


489553 


490371 


492145. 


492890 


497357 


500447 


502722 


508266 


509791 


511802 


513370 


514837 


517932 


518022 


518990 


521316 


.522091. 


522804 


45 




Start 


463040 


464793 


466289 


472562 


477113 


483906 


486594 


489062 


489790 


491348 


492207 


494520 


502513 


504332 


509762 


511329 


512647 


514770 


5.15115 


515227 


518753 


520666 


521759 


522744 


523094 


50 




Code 


ORF748 


ORF751 


ORF755 


ORF774 


ORF780 


ORF793 


ORF802 


ORF807 


ORF809 


ORF812 


ORF817 


ORF821 


ORF828 


ORF832 


ORF840 


ORF842 


ORF845 


ORF848 


ORF850 


ORF852 


' ORF853 • 


ORF855 


ORF857 


ORF859 


ORF861 


55 




Seq.lD 


SEQ.ID. No.225 


SEQ.ID. No.226 


SEQ.ID. No.227 


SEQ.ID. No.228 


SEQ.ID. No.229 


SEQ.ID. No.230 


SEQ.ID. No.231 


SEQ.ID. No,232 


SEQ. ID. No.233 


SEQ. ID. No.234 


SEQ. ID. No.235 


SEQ.ID. N6.236 


SEQ.ID. No.237 


SEQ.ID. No.238 


SEQ.ID. No.239 


SEQ. ID. No.240 


SEQ. ID. No.241 


SEQ. ID. No.242 


SEQ. ID. No.243 


SEQ. ID. No.244 


SEQ. ID. No.245 


SEQ. ID. No.246 


SEQ. ID. No.247 


SEQ. ID. No.248 


SEQ. ID. No.249 
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5 


E-val 


1.00E-104 


6,0OE-29 


1 ,00E-24 


o 


•1.00E-101 ■ 


1.00E-38 


7.00E-30. 


3.00E-40 


4.00E-79 


2.00E-47 


- 7.00E-2B 


.1.00E-144 


5.00E-69 


3,00 E-83 


2,00E-90 


1 ,00E-88 


1,00E-56 


2.00E-30 


o 
cn 

uu 
o 
o 

cm". 


o 


1.00E-57 


9.00E-35 


1.00E-167 


3.00E-35 


CD 

in 

LU 

o 
o 

cm" 


10 
»5. 


Best Blast Hit ID . 


CM- 
1 

CD 
CO 
UJ 
O 
CO 

:g 

UJ 

z 

_j 

CD 

5- 

UJ 

tr 
I— 


CO 

s 

CM 
CO 
X 

£ 

UJ 

z 
cc 

o_ 


PIR:H69735 


-» 

N 
CC 

CL 


PIR:F72395 


PIR:T30595. 


PIR:T36087 


"l 
cn 

CO 

co 
co 
o 
u_ 

'<■ 
li 

CO 

LU 
CC 

t- 


CO 

o 

CD 
CO 

CC 

CL 


Oi 
CO 

■<a- 

CL 


PIR:T31440 


PIR:T3659B 


PIR:T36597 


PIR:T34770 


TREMBL:CAJ10319 2 


PIR:T35664 


CO 

co' 

CO 

LU 
O 
CO 

UJ 

z 
_J 
m 

LU 
CC 
h- 


TREMBL:SPAJ6986_9 


TREMBLIHDIPEPJ 


PIR:T36009 


PIR:T35159 


PIR:T36011 


PIR:D70664 


SWISSPROT:PIP_LACLA 


PIR:E69115 


20 






















































S" 

9 
C 

c 

30 & 

CO 

XI 

35 


Function 


ribose-phosphate pyrophosphokinase. 


Unknown 


erido-1 ,4-beta-xylanase 


Unknown 


Unknown 


Sugar permeases 


.sugar transport system permease 


Transcriptional regulators of the LacI family 


ABC-type transport protein 


cobyric acid synthase 


UDP-N-acetylmuramyl tripeptide synthetase 


replicative DNA helicase 


DNA-damage-inducible protein f 


protein pi I uridylyltransferase 


ammonium transport protein 


prokaryotic docking protein 


transcription regulator 


glycosyl transferase 


dipeptidase 


Glucose-6-phosphate 1 -dehydrogenase 


Unknown 


glucose-6-phosphate 1 -dehydrogenase 


6-phosphogluconate dehydrogenase 


PHAGE INFECTION PROTEIN. 


phage Infection protein 


40 


Stop ' 


524870 


524952 


525916 


529737 


536261 


538240 


539079 


543120 


543141 


545394 


546249 


547721 


550895 


550935 


553209 . 


554872 


. 557072 


558130 


558210 


561700 


562722 


563791 


564101 


66669S 


572158 


45 


Start 


523860 


525833 


529629 


535559 


538453 


539055 


540002 


542101 


544958 


546143 


547718 


549247 


549303 


552758 


554501 


556131 


556236 


557099 


559943 


560036 


561700 


' 562862 


565552 


567372 


666695 


SO 


Code 


ORF864 


ORF865 


ORFB68 


ORFB73 


ORF881 


ORF885 


ORF887 


ORF892 


CO 
CD 
CO 
LL 
CC 
O 


ORF897 


ORF899 


ORF904 


ORF905 


ORF908 . 


ORF911 


ORF914 


ORF915 


ORF9.17 


ORF918 


ORF921 


ORF922 


ORF925 ; 


ORF927 


ORF931 


ORF934 




Q 


No.250 


No.251 


No.252 


No.253 


No.254 


No.255 


No.256 


No.257 . 


No.258 


No.259 


No.260 


No.261 


No.262 


No.263 . 


N0.264. 


No.265 


No.266 


No.267 


No.268 


No.269 


No.270 


No.271 


No.272 


No.273 


No.274 


55 


cr 
m 
to 


Q 


Q 


q 


9 


■9 ' 


q 


Q 


g 


9 


d 


Q 


d 


Q 


d 


d 


d 


d 


d 


9 ■ 


d 


d 


d 


9 


d 


9 






SEQ 


SEQ. 


SEQ 


SEQ 


SEQ 


SEQ. 


SEQ 


SEQ 


SEQ 


SEQ 


SEQ 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ 


SEQ. 


SEQ. 


SEQ 


SEQ. 


SEQ 


SEQ 


SEQ. 
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E-val 


2.00E-65 


2.00E-61 


9.00E-36 


2.00E-39 


4,00E-32 


5.00E-29 


6.00E-46 


1.00E-142 


1.00E48 


1.00E-163 


to 

OJ 
LU 

o 

o 
co" 


7.00E-31 


2.00E-25 


5,00E-41 


1.00E-105 


4.00E-52 


5,00 E-33 


6.00E-57 


8.00E-47 


o 


2.00E-72 


o 


3.00E-71 


3,00 E-39 


2.00E-86 






































OJ 
CVI 
CVI 




















10 

15 




Best Blast Hit ID 


PIR:D81263 


PIR:T47014 


PIR:A40182 


PIR:B71257 


PIR:E70063 


PIR:D81250 


& 
co 

CO 

o 

UJ 

-li 
m 

Ul 

tr 
t- 


PIR:T29435 


TREMBLNEW:AB032203. 


PIRT45422 


PIRNEW:G82131 


PIR;H70544 


PlR:T35894 


PIRNEW:E81981 


TREMBL1HDIPEPJ 


TREMBLNEW:AP001520. 


PIRT14971 ■ 


PIR:T36661 


PIR:T36663 


PIR:T36664 


PIR:T36588 


PIR:G70794 


PIR:A44846, 


PJR:T35383 


PIR:S42422 


20 
25 
30 
35 


Table I (continued) 


Function 


ABC transport system ATP-blnding protein 


ABC transport system permease protein 


Unknown 


Unknown 


Unknown 


Unknown 


Transposase 


Unknown 


transposase 


glutamyl-tRNA synthase 


Unknown 


Unknown 


Unknown 


formate tetrahydrofolate llgase 


dlpeptidase 


transposase 


probable transposase 


DNA polymerase 


thymidylate kinase 


DNA topolsomerase 1 


penicillin-binding protein 


Alpha-isopropyl malate synthase 


aspartate-semialdehyde dehydrogenase 


aspartokinase ■ 


aspartate kinase alpha chain 


40 




Stop 


573563 


575633 


578386 


579244 


582190 


583352 


585539 


587533 


594780 

• 


596683 


090009 


602113 


604473 


600909 


607686 


611799 


612581 


612690 


. 613847 


614745 


621883 


621961 


628025 


628113 


628735 


45 




Start 


574357 


576934 


579066 


581052 


582828 


582996 


586420 


593430 


593599 


598200 


600974 


602769 


603031 


607523 


609284 


610300 


611589 


613847 


614425 


617834 


619472 


623874 


626934 


628652 


629496 


50 




Code 


ORF941 


ORF945 


ORF949 


ORF951 


ORF956 


ORF959 


ORF965 


ORF972 


ORF975 


ORF983 


ORF987 


■ ORF991 


ORF994 


ORF998 


ORF1000 


ORF1004 


ORF1007 


ORF1010 


ORF1013 


ORF1015 


ORF1022. 


ORF1026 


ORF1032 


ORF1034 


ORF1035 


55 




g 


No.275 


No.276 


No.277 


No.278 


No.279 


No.280 


No.281 


No.282 


No.283 


No.284 


No.285 


CO 
CO 
CVI 

6 
Z 


No.287 


No.288 


OO 
CO 
CM 
d 
Z 


No.290 


No.291 


No,292 


No.293 


No.294 


No.295 


No.296. 


No.297 


No.298 


No.299 




cr 
a> 
CO 


SEQ.ID, 


SEQ.ID, 


SEQ. ID 


SEQ. ID 


SEQ, ID 


SEQ.ID. 


SEQ.ID. 


SEQ.ID. 


•SEQ. ID. 


SEQ. ID. 


SEQ, ID. 


SEQ. ID. 


SEQ. ID. 


SEQ. ID. 


SEQ. ID. 


SEQ.ID. 


SEQ. ID. 


SEQ.ID. 


SEQ. ID. 


SEQ. ID. 


SEQ.ID. 


SEQ. ID. 


SEQ.ID, 


SEQ. ID. 


SEQ. ID. 
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s 

10 

15 
20 
25 

to 

C 

c 
o 

30 & 
CD 

xt 

35 
40 


E-val 


4,00 E-67 


5.00E-41 


3.00E-70 


1 .OOE-115 


3.00E-21 


. 1.00E41 


1.00E-29 


2.00E-41 


1.00E-44 


Q 
CM 
UJ 
O 
O 

to 


2.00E-26 


CO 
CM 

UJ 
o 
o 

co 


o 


2.00E-41 


4.00E-21 


2.00E-27 


1..00E-137 


9.00E-27 


1.00E-141 


6.00E-96 


4.00E-98 


1.00E-114 


1 ,00E-37 


1.00E-149 


09-300' I. 


Best Blast Hit ID 


TREMBLAF179376J 


TREMBL:AF019629_2 


PIR:T35386 


TREMBL:SCD25_3 


PiR:H70861 


PIR:T36677 


< 

o 

UJ 

X 
DC 

CL 


TREMBLNEW:AP001515_225 


PIRIC75614 


PIR:S70209 


PIR:S76622 


PIR.PN0643 


PIR:JN0B30 


TREMBLNEW:SS0251 975_1 


PIR:T35672 


TREMBL:SCF43_17 


PIR:G70928 


TREMBL:SCF43_17 


TREMBL:AF105219_2 


PIR:D70B55 


TREMBLNEW:SC9A4_16 


PIR:S47743 


SWISSPROT:SCRR_STRMU 


TREMBL:AF065245_3 


TREMBL:AP000342_11 


Function 


Unknown '• 


flmbrla-assoclated protein 


recombination protein 


DNA polymerase III subunit 


Unknown 


secretory protein . 


Unknown function (dedA like protein) 


3-oxoacyl -[acyl-carrier-p rotein] reductase 


xanthine permease 


heat shock transcription regulator 


heat shock chaperone protein DNAJ-like . 


heat shock protein grpE 


dnaK-type molecular chaperone (Heat shock protein) 


alpha-xylosidase. 


transmembrane transport protein 


transcriptional regulatory protein, Lacl-famlly 


4-alpha-glucanotransferase 


transcriptional regulatory protein, Lacl-famlly 


alpha-1 ,4-glucosldase 


ketol-acldreductoisomerase 


ketol-acid reductolsomerase 


transmembrane transport protein 


SUCROSE OPERON REGULATORY PROTEIN 


sucP-like protein 


transposase 


. Stop 


629860 


633159 


.633162 


633790 


637695 


641511 


643758 


645418 


648919 


650544 


651146 


652397 


653053 


655916' 


658565 


660555 


664047 


665213 


667187 


667273 


668754 


670003 


671998 


674947 


677851 


45 


Start 


631140 


632029 


633761 


636699 


638822 


642575 


644480 


646218 


650268 


651128 


652162 


653053 


654930 


657994 


659278 


659548 


661813 


664182 


665376 


668331 


669803 


: 671520 


673038 


676470 


676B05 


50 


Code 


ORF1037 


ORF1040 


ORF1041. 


ORF1042 


ORF1045. 


ORF1052 


ORF1056 


ORF1059 


ORF1064 


ORF1067 


ORF1068 


ORF1072 


ORF1073 


ORF1077 


ORF1082 


ORF1085 


ORF1090 


ORF1091 


ORF1094 


ORF1096 


ORF1097 


ORF1098 


ORF1101 • 


ORF1107 


ORF1108 


55 


Seq.lD 


SEQ.ID. No.300 


SEQ.ID. No.301 


SEQ.ID. No.302 


SEQ.ID. No.303 


SEQ. ID. No.304 


SEQ. ID. No.305 


' SEQ. ID, No.306 


SEQ. ID. No.307 


SEQ.ID. No.308 


SEQ. ID. No.309 


SEQ.ID. No.310 


SEQ.ID. No.311 


SEQ.ID. No.312 


SEQ. ID. No.313 


SEQ. ID, NO.314 


SEQ. ID. No.315 


SEQ. ID. No.316 


SEQ.ID. No.317 


SEQ.ID. No.318 


SEQ.ID. No.319 


SEQ.ID. No.320 


CM 
CO 

d 
Z 

Q 

a 

Ul 

co 


SEQ. ID. No.322 


SEQ. ID. No.323 


SEQ. ID. No.324 
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E-val 


1.00E-40 


1.00E-158 


3,00 E-52 


9,00 E-34 


1.00E-35 


1.00E-165 


1.00E-115 


1 ,00E-64 


o 

LU 

O 
O 


CM 
CM 

LU 
o 
o 
in 


• 1.00E-96 


' 9.00E43 . 


7.00E-84 


8,00 E-23 


2.00E45 


2.00E-83 


1 .00E-86 


•1.00E-23 


CO 
CO 

lii 
o 
o 

CD 


6.00E-42 


1.00E-35 


1.00E-127 


7.00E-23 


2.00E-20 


2.00E-53 


10 
15 




Best Blast Hit ID 


,TREMBLNEW:SCG22_12 


PIR:S55274 


TREMBLNEW:AE004526 1 


SWISSPROT:SCRR_STRMU 


CO 

CM 
CO 

O 

5 

Ul 

z 
or 
o_ 


PIR:T36519 


CD 
CJ 

1 

a 

O 
_J 

■s 

_i 
m 

UJ 

CC 

\- 


PIR:F70549 


TREMBL:SCF15_2 


CO 

o 

CO 

o 
r~ 
X 
DC 
Q. 


CD 

Ol 
CO 

p 

CC 
Ol 


PIR:F72359 


PIR:B69722 


TREMBL:AB003158_1 ■ 


PIR:JQ0759 


PIR:JQ0760 


PIR:G81243 


PIR:C70619 


CM 

co 
co 

CM 

co 
or 
o_ 


PIR:D64900 


PIR:S66056 


TREMBLNEW:SC9E12 7 


PIR.T36712 


PIR:T36713 


PIR:H70699 


SO 
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Table I (continued) 


. Function 


transcription regulator, Lad family 


alpha-L-arablnofuranosldase 


potassium uptake protein 


Transcriptional regulators of the Lad family 


Unknown 


adenylosuccinate synthetase 


fructose-bisphosphate 


heat shock protein 


ferredoxin/ferredoxin--NADP reductase 


Unknown '. 


cation-transporting ATPase 


periplasmic serine proteinase 


queulne tRNA-ribosyltransferase 


hemolysin III 


restriction endonuclease 


methyltransterase 


peptide methionine sulfoxide reductase 


Unknown 


Unknown 


Unknown 


Unknown 


X-Pro dipeptidyl-peptldase 


Unknown 


Unknown 


phosphoprotein phosphatase 


40 




Stop 


662749 


683955 


685919 


687912 


690945 


692552 . 


695115 


695332 


696445 


698863 


. 699209 


.701780 


705531 


707651 


710264 


710277 


714565 


716076 


718646 


.720885 


723052. 


730425 


731311 


. 731871 


733570 


45 




Start 


683780 


685751 


. 687709 


688994 


692525 


693835 


694051 


696249 


697893 


698024 


701365 


703804 


, 704221. 


706971 


708786 


711545 


715545 


718646 


719290 


721991 


723768 


. 727984 


730613 


731344 


731879 . 


50 




Code 


ORF1124 


ORF1126 


ORF1127 


ORF1130 


.ORF1138 


ORF1141 


ORF1143 


in 
•>cr 

£ 
O 


ORF1147 


ORF1148 


ORF1150 


ORF1157 


ORF1162 


QRF1168 


ORF1172 


ORF1173 


ORF1178 


ORF1182 


ORF1188 


ORF1192 


ORF1198 


ORF1208 


ORF1210 


ORF1212 


ORF1213 


55 




Seq.lD 


ID. No.325 


ID. No. 326 


ID. No.327 


ID. No.328 


ID. No.329 


ID. No.330 


ID. No.331 


ID. No.332 


ID. No.333 


ID. No.334 


ID. No.335 


ID. Nb.336 


ID.No.337 


ID. No. 338 


ID. No.339. 


ID. No.340. 


ID. No.341 


D. No.342 


ID. No.343 


D. No.344 


ID. No.345 


D. No.346 


D. No.347 


D. No.348 


D.. No.349 








SEQ 


SEQ. 


SEQ 


SEQ 


SEQ. 


d 

LU 

co 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


a 

LU 
CO 


SEQ. 


a 

LU 
CO 


SEQ. 


SEQ. 


SEQ. 


2 

CO 


seq; 


SEQ. 


SEQ.- 


a , 

LU 
CO 


SEQ. 


G> 
jj 

Ji 


03S 


SEQ. 
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E-val 


6.00E-75 


2.00E-87 


1.00E-55 


• 1.00E-107 


8.00E-62 


8.00E-28. 


1.00E-28 


9.00E-24 


o 


1.00E-58 


5.00E-36. 


1.00E-91 


001-300' I 


3,00E-34 


3.00E-23 


5,00 E-65 


CO 

LU 
O 

o 

co" 


9.00E-66 


1.00E-36 


CM 

in 

LU 

o 
o 

in 


4,000-30 


1.00E-3B 


8.00E-68 


1.00E-29 


1.00E-162 


10 
15 




Best Blast Hit ID 


P|R:T36715 


PIR:F70699 . 


PIR:E70699 


PIR:T36717- 


PIR:T36720 


PIR:T36718 


PIR:T36722 


PIR:A70840 


PIR:S77252 . 


PIR:E75438 


PIR:D75261 


PIR:H69870 


SWISSPROTiCAPP ANASP I 


PIR:F75302 


CM 

*~l 
CM 
CM 
C3 
O 
CO 

LU 

z 
_J 

CO 

LU 
CC 

h- 


TREMBLNEW:AP001515_262 


SWISSPROTiTRXB^EUBAC 


TREMBL:AF016233_2 


PIR:E70804 


PIR:S76248 


TREMBL:AF068267_1 


CO 

o 

CO 

CC 
CL 


PIR:E71009 


PIR:D709B1 


TREMBLNEW:AP001514_107 
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Table I (continued) 


Function 


FtsW cell division protein 


pentalllln^blnding protein 


serine/threonine protein kinase 


probable serine/threonine protein kinase 


glutamine amidotransferase 


Unknown 


Unknown . 


Unknown . 


Phosphorylase . '• 


tryptophanyl-tRNA synthetase 


Unknown 


sodium/proline symporter 


PHOSPHOENOLPYRUVATE CARBOXYLASE 


ATP-dependent DNA helicase 


transcription regulator, Lad family 


permease 


THIOREDOXIN REDUCTASE 


alkyl hydrogen peroxide reductase 


icfA like carbonic anhydrase icfA 


hemolysin 


Unknown . 


Unknown 


aspartate transaminase 


transcriptional regulatory protein 


NADP-specific glutamate dehydrogenase 


40 




Stop 


735126 


736589 


737536 


739605 


739837 


740532 


741767 


743795 


745313 


750104 


750288 


752125 


758911 


762024 


763888 


. 766721 


766844 


768929 


770375 


771978 


772554 


776737 


779817 


781785 


783450 


45 




Start 


733570 


. 735126' 


7365B9 


737536 


740478 


741767 


742564 


744637 

— ;. ... 


747835 


749016 


752072 


753762 


756161 


763670 


. 764892 


765438 


768757 


769489 


769695 


770716 


772078 


' 775634 


781148 


781354 


782107 
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CM 
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-a- 
r- 

CM 
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o 

CO 
CM 


5 
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CD 
CM 


GO 

O) 
CM 


o 
o 
CO 


CM 
O 

co 


50 




O 

O 


ORF 


ORF' 


ORF' 


ORF' 


ORF; 


ORF' 


ORF' 


ORF' 


ORF' 


ORF 1 


ORF' 


ORFl 


ORF1 


ORF1 


Li- 
CC 

O 


ORF1 


LL. 

cr 
O 


ORF1 


ORF1 


ORF1 


ORF1 


ORF1 


ORF1 


ORF1 


ORF1 






Q 


No.350 


Nb.351 


No.352 


No.353 


No.354 


No.355 


No.356 


No.357 


No.35.8' 


No.359 


No.360 


No.381 


No.362 


No.363 


No.364 


No.365 


No,366 


No.367 


No.368 


No,369 


No.370 


CO 

d 
Z 


No.372 


No.373 


No.374. 
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d 
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9 


9 


9 
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d 


9 


9 


9 


9" 
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SEQ 


SEQ. 


SEQ. 


■ SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ, 


SEQ. 


SEQ, 


SEQ. 


SEQ. 

i — i 


SEQ. 
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E-val 


3.00E-67 


o 


o 


7,00 E-60 


2,00 E-88 


1.00E-106 


9.00E-33 


2,00 E-40 


3,00 E-33 


1.00E-76 


2.00E-68 


1.00E-82 


4.00E-44 


1,00E-28 


1.00E-162 


3.00E-26 


7.00E-62 


7.00E-36 


1.00E-33 


o 


1.00E-145 


1.00E-29 


1.00E-127 


1.00E-20 


10 
15 




Best Blast Hit ID 


PIR:D71089 


PIR:T10970 


SWISSPROT:GYRB_STRC0 


PIR:T,10967 


o 
oo 

s 

tr 

CL 


PIR:IQMCL 


PIR:T36570 


PIR:T36571 


CO 

1 

cn 

LO 

co 

LL 

< 
_i 
ca 

LU 

tr 
l- 


TREMBLAF187159 5 


PIRT36574. 


SWISSPROT:TRXB_MYCSM 


PIR:T36581 


PIR:E70600 


PIR:T36583 


PIR:F70603 


PIR:E70603 


TREMBL:SCL11_7 


PIRIE70648 . 


SWISSPROT:RIR1_MYCTU 


PIR:C70B61 


PIR:T44648 


TREMBLSRMSIKJ 


PIR:S52348 


20 
25 
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35 


Tablet (continued) 


Function 


Unknown 


DNA topoisomerase chain 


DNA GYRASE SUBUNIT B 


DNA repair and genetic recombination protein 


DNA-directed DNA polymerase III beta chain 


replication initiation protein dnaA 


SpoliJ like protein 


Unknown 


GLUCOSE INHIBITED DIVISION PROTEIN B 


ATPases Involved in chromosome partitioning 


partitioning or sporulation protein 


THIOREDOXIN REDUCTASE 


Unknown 


MutT/nudix family protein 


RNA nucleotidyltransferase . 


Unknown 


dimethyladenosine transferase 


serine/threonine protein kinase 


involved in RIBONUCLEOTIDE REDUCTASE FUNCTION 


RIBONUCLEOSIDE-DIPHOSPHATE REDUCTASE 
ALPHA 


ribonucleotide reductase subunit R2F 


glycosyl trsneferase (cps like) 


ABC-type sugar transport protein 


Unknown 


40 




Stop 


783833 


786722 


789441 


792179 


793445 


795305 


798964 


799621 


800438 


801659 


803020 


803332 


806617 


810608 


813396 


814163 


815110 


819239 


826943 


829254 


830558 


831797 


833284 


843461 


45 




Start 


785251 


789370 


791528 


793363 


794566 


796804 


797960 


799091 


799776 


800652 


801662 


804354 


808341 


812056 


811984 


815110 


816033 


821476 


826539 


827062 . 


829569 


830640 


832160 


841887 


50 




Code 


ORF1304 


ORF1309 


ORF1310 


ORF1314 


ORF1315 


ORF131B 


ORF1323 


ORF1325 


ORF1327 


ORF1328 


ORF1329 


ORF1331 


ORF1337 


ORF1342 


ORF1344 


ORF1347 


ORF1350 


ORF1358 


ORF1374 


ORF1376 


ORF137B 


ORF1379 


ORF1381 


ORF1390 


55 




Seq.lD 


SEQ. ID. No.375 


SEQ. ID. No.376 


SEQ.ID. No.377 


SEQ.ID. No.378 


SEQ. ID. No.379 


SEQ.ID. No.380 


SEQ. ID. No.381 


SEQ. ID. No.382 


SEQ. ID, No.383 


SEQ.ID. No.384 


. SEQ.ID. No.385 


SEQ.ID. No.386 


SEQ. ID. Np.387 


SEQ. ID, No.388 


SEQ. ID. No.389 


SEQ.ID. No.390 


. SEQ. ID. No.391 


SEQ. ID. No.392 


SEQ. ID. No.393 


SEQ. ID. No.394 


SEQ.ID. No.395 


SEQ.ID. No.396 


SEQ.ID. No.397 


SEQ.ID. No.398 
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E-val. 


1 ,00E-41 


1.00E-108 


2.00E-51 


OS-300'8 


8.00E-35 


CO 

r- 

LU 

0 
0 


1.00E-48 


2,00 E-96 


1.00E-141 


1.00E-179 


9.00E-23 


3.00E-67 


4.00E-78 


.0 


3.00E-92 . 


1.00E-25 


3.00E-80 


3.00E-65 


1.00E-135 


CO 
CO 

lii 
0 
0 
00 


co 
LU 
0 
o_ 


1.00E-151 


1.00E-69 


1.00E-108 


10 
15 




Best Blast Hit ID 


TREMBl:AP019629_2 


TREMBLNEW:SC5F8_10 


CO 

1 

< 

oo 
Q 
O 
CO 

LU 

z 
_l 
CQ 

LU 
CC 
(— 


TREMBLNEW:SC2D46_22 


PIR:S16567 


TREMBLN EW: AP001 51 5_249 


TREMBLNEW:AB032203_2 


PIR:T35077 ;. 


SWISSPROT:TAPE_ARTGO 


TREMBL:SCI8_25 . 


TREMBL:SCD25_30 


PIR:D69159 


TREMBL:REU0 10061.1 


PIR:S04781 


PIR:A70903 


PIR:D70660 ; 


PIR:T36025 


PIR:D70903 - ■ 


PIR.T36019 


PIR:A70916 


PIR:T44580 


■ 

V 

?^ 

CM 
CO 

X 

CO 
5 
Ui 
DC 
(— 


SWISSPROT:BRNQ_BACSU 


PIR:C70917 


20 
25 
30 
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Table I (continued) 


Function 


flmbrla-associated protein 


Unknown 


RNA methyltransferase 


permease 


beta-1,4-D-xylanase 


cation-transporting ATPase 


transposase 


cyclase involved in histldine biosynthetic pathway 


anfhranllate synthase component I ." 


ABC transporter ATP binding protein 


Unknown 


methyl coenzyme M reductase system, component A2 


transposase 


Excinuclease ABC chain A ■ 


excinuclease abc subunit c 


shikimate 5-dehydrogenase 


Unknown 


Unknown 


phosphogtycerate kinase 


triose phosphate Isomerase 


lactate dehydrogenase 


aminotransferase 


branched-chain amino acid transport system II carrier 
protein 


transaldolase 


40 




Stop 


844631 


847364 


847477 


849849 


850758 


853928 


855527 


856453 


858619 


860536 


861726 


862453 


865593 


878154 


880670 


881750 


882736 


883885 


885259 


886119 


887483 


88999S 


891415 


892881 


45 




Start 


843648 


845955 


848496 


848068 


849865 


850944 


854346 


855686 


. 856856 


858938 


861361 


864063 


866861 


875122 


878307 


880782 


881753 


882938 


883937 


885319 


886536 


' 888457 


892749 


893981 


50 




Code ; 


ORF1392 


ORF1399 


ORF1400 


ORF1401 


ORF1403 


ORF1405 


ORF1407 


ORF1410 


ORF1413 


ORF1415 ' 


ORF1418 


ORF1421 


ORF1427 


ORF1442 


ORF1443 


ORF1445 


ORF1447 


ORF1450 


ORF1453 


ORF1454 


ORF1457 


ORF1463 . 


ORF1467 


ORF1469 






Q 


CO 
CO 

oo 
b 
Z 


No.400 


No.401 


No.402 


No.403 ' 


No.404 


No.405 


No.406 


No.407 


No.408 


No.409 


No.410 ' 


No.411 


No.412 


No.41.3 


No.414 


No.41.5 


No.416 


No.417 


No.418 


No.419 


No.420 


No.421 


No.422 


55 




cr 
0) 
CO 


ri 


Q 


Q 


0" 
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ri 
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ri 


ri 


ri 
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ri 
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SEQ 


SEQ. 


SEQ. 


SEQ 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ.! 
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E-val 


o 


7.00E-65 


8.00E-69 


lO 

in 

LU 

o 
o 

co 


3.00E-35 


o 


1.00E-49 


3.00E-B4 


7.00E-32 


7.00E-26 


1.00E-113 


3.00E-32 , 


3.00E-92 


3.00E-98 


1.00E-166 


7.00E-64. 


1.00E-143 


5.00E-57 


1.00E-14B 


3.00E-34 


2.00E-28 


4,00 E-23 


4.00E-21 


2.00E-73 


10 
15 


Best Blast Hit ID 


PIR:T35162 


TREMBL:SCC77_22 


TREMBLSCC77_21 


CO 

1 

to 
(3 

C\J 

O 
CO 

LU 
2 
_J 

no 

LU 

oc 


PIR.S72871 


TREMBL:SCL2_21 


TREMBLSCL2 20 


PIR:F70500 • 


PIR:A70727 


TREMBLSCL2 9 


TREMBL:SCL2_8 


TREMBL:ECU82664_67 


■«* 

CO 

•5J- 
rr 

Q_ 


PIR.T45435 


TREMBL:AB003159_3 


TREMBL:MLCB1351_3 


PIR:H70504' 


TREMBL:AF2039.09_1 


TREMBL:SCI41_14 ' 


PIR:G70721 


TREMBLNEW:SC1A8A 4 


PIR.B70721 


TREMBLNEW:SC1A8A_9 


PIR:D70513. 


8 8 !S s 
Table I (continued) 


Function 


transketolase 


heat-lnduclble transcriptional repressor 


heat-shock protein DnaJ 


bacitracin resistance like protein 


Unknown 


threonine-tRNA synthetase 


Histidine triad protein (hit) 


Unknown 


probable crossover junction endodeoxyribonuclease ruvC 


holliday junction DNA helicase 


holliday junction DNA helicase 


Adenine/guanlnephosphoribosyltransferases 


succinyl-CoA synthetase beta subunit 


succinyl-CoA synthetase alpha subunit 


5'-phosphorlbosyl-5-aminoimidazole-4-carboxamide 
formyltransfer ase 


pseudouridylate synthase 


GTP-binding protein 


UDP-glucose pyrophosphorylase 


helicase 


hypothetical protein Rv1830 - Mycobacterium 


Unknown 


Unknown 


>N 

oo 

JC 
D_ 
10 

o 

CL 

co 

OJ 

13 

xz 

CL 
to 
O 

sz 

CL. . 
CO 

e . 

CD 

i 

S 

CD 
CO 

>> S 
o to. 

.9? 2 
xj js 

oL <2 

Q § 
O & 


ATP-phosphoribosyl transferase 


40 


Stop 


894105 


897700 


898901 


900767 


900930 


904610 


905334 


906228 


906818 


907502 


908566 


909710 


.911009 


911920 


914904 


917185 


919311 • 


921195. 


926180 


928449 


929182 


929632, 


931768 


932399 


45 


Start 


896210 


8965B5 


897759 


899886 


901919 


902580 


904753 


905476 


906237 


906879 


. 907505 


909132 


909810 


911012. 


913270 


916418 


917185 


919669 . 


923592 . 


929078 


929622 


930450 


932.460 


933247 


50 


Code 


ORF1470 


ORF1473 


ORF1476 


CO 

-3- 

LL 

oc 

o 


ORF1482 


ORF1486 


ORF1488 


ORF1490 


ORF1491 


ORF1493 


ORF1494 


ORF1497 


ORF1499 


ORF1501 


ORF1506 


ORF1512 


ORF1514 


ORF1515 


ORF1523 


ORF1529 


ORF1530' 


ORF1531 


ORF1535 


ORF1536 


55 


Seq.lD 


SEQ.ID. No.423 


SEQ.ID. No.424 


SEQ.ID. No.425 


SEQ.ID. No.426 


SEQ. ID. No.427 


SEQ. ID. No.428 


SEQ. ID. No.429 


. SEQ. ID. No.430 


SEQ. ID. No.431 


SEQ. ID. No.432 


SEQ.ID. ID. No.433 


SEQ. ID. No.434 


SEQ.ID. No.435 


SEQ.ID. No.436 


SEQ.ID. No.437 


SEQ.ID. No.438 


SEQ.ID. No.439 


SEQ.ID. No.440 


SEQ. ID. No.441 . 


SEQ.ID. N6.442 


SEQ. ID. No.443 


SEQ. ID. No.444 . 


CO 

d 
2 

d 
d 

LU 
CO 


SEQ.ID. No.446 
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Table I (continued) 
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Table I (continued) 


Function 


Unknown 
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ABC transporter (ATP-binding protein) 


two-component response regulator 
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carbon starvation protein 
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uracil transport protein 


Unknown 


serine-threonine protein kinase 


macrolide-efflux pump 


multidrug resistance protein 


gtp-binding protein 


pyrroline-5-cart)oxylate reductase 


prolyl aminopeptidase 
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Table i (continued) 
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Table I (continued) 


Function 


3-phosphoshiklmate 1 -carboxyvlnyltransferase 


resolvase 


transposase 


lactose transport protein 


beta-galactosidase 


3-methyladenlne-DNA glycosylase 


glycosyl hydrolase 


DNA polymerase I 


response regulator 


pyruvate kinase 


transmembrane transport protein 


excinuclease ABC chain B 


Unknown 


ribosomal protein S1 


methylenetetrahydrofolate dehydrogenase 


ABC transporter, ATP-binding protein 


permease of ABC zinc transporter 


Unknown 


transcription factor 


1,4-alpha-glucan branching enzyme . 


phosphate response regulator protein phoP 


two-component system sensor kinase 


transcription regulator 


cell division-related protein 


Sensory transduction histldlne kinase 
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Best Blast Hit ID 


TREMBL:AB003158J 


PIR:F70894 


PIR:546228 
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Table I (continued) 


Function 


hemolysin-like protein 


transcription elongation factor 


peptidylprolyl isomerase 


L-serlne dehydratase 


exopolyphosphatase 


Unknown 


phosphopyruvate hydratase 


oxidoreductase 


transcriptlon-repair coupling factor ( 


peptidyl-tma hydrolase 


phosphinothrlcin acetyltransferase 


Unknown 


Unknown . 


gamma-glutamyl phosphate 


glycine hydroxymethyltransferase 


threonine synthase 


cation-transporting P^VTPase 


ABC transporter, ATP-binding prote 


DNA repair protein 


Unknown 


potassium uptake protein KtrB 


potassium uptake protein KtrA 


Hemolysin A homolog 


sugar phosphatase 


Unknown 
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Best Blast Hit ID 
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Function 


tyrosyl-tRNA synthetase 


molybdopterin biosynthesis protein 


thiamin biosynthesis protein 


argininosuccinate lyase 


argininosuccinate synthase 


arglnine repressor 


ornithine carbamoyltransferase 


acetonitrile aminotransferase 


acetylgiutamate kinase 


glutamate N-acetyltransferase 


N-acetyl-gamma-giutamyl-phosphate reductase 


phenylalanyl-tRNA synthetase beta chain 


phenylalanyl-tRNA synthetase alpha chain 


rRNA methylase 


cation ABC transporter (ATP-bindIng protein) 


peptidase 


dihydrollpoamide dehydrogenase 


glutamine synthetase I 


GALACTOSIDE O-ACETYLTRANSFERASE 


endonuclease III related protein 


Unknown .. 


exclnuolease ABC chain A 


alcohol dehydrogenase (Zn-dependent) 


translation elongation factor EF-Tu 


translation elongation factor EF-G 
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1283591 
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1287339 


1288348 


1289633 
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ORF2120 


ORF2122 


ORF2123 


ORF2125 


ORF2126 


ORF2127 


ORF2128 


ORF2133 


ORF2135 
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transcriptional regulator 


arablnosldase 
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Asparaginase 


aminotransferase 


hisH-like amidotransferase 
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amino acid transporter 
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DNA helicase recQ 


CYSTATHIONINE GAMMA-SYNTHASE. 


cystathionine beta-synthase 


oligopeptide iransport protein 


oligopeptide transport ATP-blnding protein 1 


PEPTIDE ABC TRANSPORTER PERMEASE 


oligopeptide . 


probable oligopeptide binding protein 
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solute-bindlng lipoprotein 


solute-binding lipoprotein • 
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Table 1 (continued) 
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Table I (continued) 
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Table I (continued) 


Function 


2-hydroxyhepta-2,4-diene-1 ,7-dioate isomerase 


transcriptional regulator 


histldine ammonia-lyase 


endopeptidase CIpB (ATP-binding.chain B) 
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Best Blast Hit ID 
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TREMBLNEW:AP001509 


PIRNEW:A82130 


PIR:B69632 
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TREMBLNEW:SC7E4 33 


PIR:C70845 


TREMBLSCI41 38 


PIR:A72289 


PIR:H69470 


TREMBLECD720 13 


PIRJN0491 


TREMBL:SCC88 25 


PIR:T35661 


PIR:T34959 . 


PIRT42015 


PIR:540823 


PIR:S40823 


PIR:D70898 


PIR:T35087 


PIR:T09633 


PIR:T36835 


P.IR.T36833 


PIR:D70503 


Function 


transcription regulator, Lacl family 


Unknown 


unknown 


transcriptional regulator, ROK family 


glucose kinase 


Transcriptional regulators of NagC/XylR family 


glucosamine 6-phosphate isomerase 


N-acetylglucosamine-6-phosphate deacetylase 


oligopeptide binding protein 


oligopeptide ABC transporter, permease protein 


dlpeptide ABC transporter, permease protein (dppC) 


ABC transporterATP-binding protein 


X-Pro aminopeptidase 


folylpolyglutamate synthase 


chromosome segregation SMC protein 


U D P-N-acetylmuramoylalany l-D-glutamyl- 
2,6-diaminoplmelate ligase 


RNA polymerase slgma factor 


aldose epimerase family protein 


Unknown 


penicillin tolerance protein LytB 


Unknown ' 
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translation initiation factor IF-3 . 


ribosomal protein L20 


integrase/recomblnase 


Stop 


1661334 


1661462 


1662764 


1665242 


1667591 


1667747 


1670015 


1871354 


1673268 


1674524 


1675695 


1677408 
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1683621 
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1694969 


1695015 


1695618, 
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1700490 


1701531 
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1662757 


1664059 
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1666680 
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1669206 


1669981 
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1674646 


1675702 


1679626 


1682035 


1683685 


1690151 


1690269 


1692608 


1693689 


1693914 


1695449 


1696673 


1699247 


1700110 


1700608 
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ORF2693 


ORF2695 


ORF2697 


ORF2701 


ORF2704 


ORF2706 


ORF2709 


ORF-2710 


ORF2713 
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ORF2753 
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SEQ.ID. No.832 
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Best Blast Hit ID 


TREMBL:U73183_1 


PIR:T36875 
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Table I (continued) 


Function , 


ABC transporter ATP binding subunit 


partitioning or spomlation protein . 


Unknown' 


Unknown 


MutT/nudix family protein 


quinolinate synfhetase A 


L-aspartate oxidase 


nicotinate-nucleotide pyrophosphorylase 


cystelne desulfurase 


transport protein 


GTP-blnding translation elongation factor 


oxidoreductase 


Integrase/recomblnase 


peptide transport system secreted peptide 


peptidetransport system permease - 


oligopeptide ABC transporter (permease) 


peptidetransport system ABC-transporter 


exodeoxyribonuclease 


Unknown 


RNA methyjtransferase 


Integral membrane ATPase 


aconitate hydratase 


two-component response regulator 


Unknown 
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Stop . 


1704505 


1705606 


1706539 


1707268 


1708227 


1709568 


1711287 


1712184 


1713434 


1714815 


1717061 


1719725 


1721302 


1723195 


.1724227 


1725444 


1727476 


1728441 


1730137 


1731395 


1734706 


1737526 


1741469 


1745715 
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1701662 


1704710 


1705628 


1706555 


1707403 


1708291 


1709659 


1711294 


1712178 


1713469 


1715133 


1718661 


1720232 


1721558 


1723499 


1724443 


1725470 


1727584 


1729382 


1730091 


1732196 


1734830 


1742368 


1744819 
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ORF2757. 


. ORF2759 


ORF2761 


ORF2762 


ORF2764 


. ORF2767 
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ORF2769 


ORF2770 


ORF2772 


ORF2776 


ORF2778 


ORF2784 


ORF2788 


ORF2790 


ORF2793 


ORF2795 


ORF2797 


ORF2798 


ORF2803 


ORF2806 


ORF2810 


ORF2611 . 


ORF2820 


ORF2826 
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SEQ. ID. 


SEQ.ID. 


SEQ. ID. 


SEQ. ID. 


SEQ. ID. 


SEQ. ID. 


SEQ. ID. 


SEQ.ID. 


SEQ. ID. 


SEQ. ID. 
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Best Blast Hit ID 


TREMBLNEW:AEQ047B1_1 


TREMBLNEW:AP001513_158 


PIR:T35107 
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PIR:T35683 
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h- 


PIR:C70592 


PIR:S71922 


PIR:T35529 


TREMBL:SCL24_2 


TREMBLNEW:AF233851_2 


TREMBLSGCRTBJ 


PIRT35194 


PIRT35196 


TREMBLNEW:AE004621 9 
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PIR.T35872 


PIR:T35880 


PIR:T34820 
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PIRNEW:T51359 


PIR:E71174 


Function 


Unknown 


GTP-pyrophosphoklnase 


Unknown 


tRNA delta(2)-isopentenylpyrophosphate 


ftsK homolog - DNA translocase 


CDPdiacylglycerol--glycerol-3-phosphate 
3-phosphatldyltransferase 


RecA protein 


Unknown 


preproteln translocase secA 


anthranllate phosphoribosyltransferase : 


acyltransferase 


serineAhreonine-specific protein kinase 


po lyp reny I synthase - like protein 


transcription initiation factor sigma 


DNA gyrase chain B 


ribokinase 


ATP-dependent helicase 


DNA gyrase 


Unknown 


deoxyurldlne 5'-trlphosphate nucleotldohydrolase 


GTP pyrophosphokinase 


Transposase 


peptldylprolyl isomerase 


Unknown 
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1746871 


1746910 


1749230 


1750227 


1754131 


1754943 


1757902 


1760037 


1763093 


1765393 

... 


1767242 


1767300 


1769717 


' 1773289 


1775665 


1778094 


1782843 


1783793 


1788035 


: 1790064 


1792522 


1792646 


1794112 


1796076 
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1745858 


1747749 


1747791 


1749061 


1751033 


1754287 


1756712 


1759378 
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1764350 
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Best Blast Hit ID 


PIR:F70685 


PIR:A75272 
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SWISSPROT:Y4ME_RHISN 


PIR:T36167 


PIR:H41662 


TREMBL:AF121000_6 


PIR:F70925 


PIR:B70846 


TREMBL:AB003154_2 


PIR:E70937 


PIR:C70631 


PIR;T34809 


PIR:D70925 
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Table I (continued) 


Function 


CO 
CO 

.co . 
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0) 

15 

CD 

CD 
O 

x: ■ 

D. 
CO 

O. 

_c 

CL 


transport protein 


glutamate-binding periplasmic protein 


leucyl-tRNA synthetase 


competence-like protein 


Unknown 


Unknown 


Unknown 


O-sialoglycoprotein endopeptidase - 


Integrase/recomblnase 


Unknown 


DNA topoisomerase III topB 


type II site-specific deoxyribonuclease 


. site-specific DNA-methyltransferase (cytosine-specific) 


Unknown 


Unknown 


aggregation protein precursor 


plasmid partition protein 


Unknown 


Isocltrate dehydrogenase 


IMP dehydrogenase 


long-chain-fatty-acid-CoA llgase 


polypetide deformylase 


ribosomal protein S2 


translation elongation factor EF-Ts 
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1799924 


1800989. 


1801930 


1804936 


1807627 


1809113 


1810151 


1810738 


1813292 


1814894 


1822820 


1828001 


1829231 


1830545 


1833872 


1837642 


1858430 


1866103 


1872032 


1873996 


1876462 


1879293 


1879788 


0660881 


1881920 
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1799082 


1B00042 


1801007 


1801976 


1805942 


1807770 


1809279 


1810175 


1812252 


1813962 


1824277 



1825368 


1830496 


1831513 


.1835344 


1839447 


1863238 


1866693 


1871346 


1875342 


1875134 


1877209 


1879303 


1880103 


1881072 
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ORF2965 


ORF2968 


ORF2975 


ORF2984 


ORF3031 


ORF3041 


ORF3055 


ORF3059 


ORF3060 


ORF3062 , 


CO 
CO 

o 

CO 

u_ 
tr 
O 


ORF3066 


ORF3067 






Q 


CO 
CO 
CO 

6 
2 


N6.887 • 


No.888 


No.889 


No.890 


No.891 


No.892 


No.893 


No.894 


No.895 


CO 
CD 
CO 

d 
2 


No.897 


No.898 


No.899 


No.900 


No.901 


No.902 


No.903 


No.904 


No.905 


No.906 


No.907 


No.908 


CD 
O 

o> 
d 
2 


No.910 


55 




cr 

0) 

to 


Q 


g 


Q 


d 


Q 


Q 


Q 


9 


9 


Q 


Q 


d 


d 


ci 


ci 


ci 


d 


d 


d 


d 


d 


d 


d ' 


d 


d 






SEQ. 


SEQ. 


SEQ. 


SEQ 


SEQ, 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ: 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ, 


SEQ, 


SEQ. 


SEQ. 


SEQ, 



41 . ' 

i 

> 

ISDOCID:<EP 12271S2A1J_> 



EP1 227 152 A1 



5 




E-val 


7.00E-74 


' 5.00E-55 


6.00E-27 


1.00E-113 


3.00E-37 


CO 

f 

UJ 
o 
o 


4.00E-72 


2.00E-55 


p 


1.00E-21 


4.00E-28 


2,00 E-50 


6.00E-48 


1.00E-163 


8.00E-76 


4.00E-31 


o 

CVJ 
LU 

o 
o 

cvi" 


6.00E-24 


1.00E-129 


1.00E-150 


o 


LU 

O 
O 


o 

CM 

LU 
o 
o 






















































10 
15 




Best Blast Hit ID 


PIR:T34811 


PIR:T35471 


PIR:T35470 


PIR:T35453 


TREMBLNEW:AE004833 


TREMBLNEW:AB032203 


PIRT36613 


PIR:E70040 


TREMBL:AF1 24596 1 
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PIR:B42400 
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PIR:A41707 


PIR:D72386 
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TREMBL:SC4A7_14 


TREMBLAF1 26429 1 


PIR:T42208 


PIR:S55505 


TREMBL:AF052750_3 


PIR:H69735 
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Table I (continued) 


Function 


uridylate kinase 


ribosome recycling factor 


phosphatidate cytidylyltransferase 


Unknown 


pfpl like protease 


ransposase 


dCTP deaminase 


Na+/H+ antiporter 


alpha-galactosldase 


transcriptional repressor 


sugar transport system (sugar-binding protei 


membrane protein 


multiple sugar-binding transport system pern 


ollgp-1 ,6-glucosldase 


threonine dehydratase 


SIR2 family transcription regulator 


transcriptional regulator 


: Unknown ' 


acyl-CoA carboxylase complex A chain 


proplonyl-CoA carboxylase complex B chain 


fatty-acid synthase 


gene: "tnpA ; product: "transposase"; 


endo-1 ,4-beta-xylanase xynD 


40 




Stop 


1882834 


1883462 


1884471 


1885851 


1885858 


1887793 


1889584 


1893217 


1898285 


1898456 


1901121 


1902072 


1902957 


1906193 


1907726 


1907878 


1916519 


1916632 


1923762 


1925377 


1934934 


1942971 


1947220 
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Start 


1882097 


1882914 


1883386 


1884685 


1886403 


1886612 


1890162 


1895286 


1895982 


1899661 


1899592 


1901146 


1902094 


1904376 


1906482 


1908525 


1915770 


1917495 


1921801 


1923758 


1925419 


1944074 


1944695 


50 




Code 


ORF3069 


ORF3071 


ORF3072 


ORF3074 


ORF3075 


ORF3078 


ORF3083 


ORF3091 


ORF3095 


ORF3097 


ORF3098 


ORF3101. 


ORF3103 


ORF3107 


ORF3108 


ORF3109 


ORF3128 


ORF3129 


ORF3139 


ORF3140 


ORF3141 


ORF3162 


ORF3168 


55 




Seq.lD 


SEQ.ID. No.911 


SEQ.ID. No.912 


SEQ. ID. No.913 


SEQ.ID. No.914 


m 

OS 

d 
2 

9 
d 

UJ 
CO 


SEQ.ID. No.916 


SEQ. ID. No.917 


SEQ. ID, No.918 


SEQ.ID. No,919 


SfcQ. ID. NO.920 


SEQ.ID. NO.921 


SEQ. ID. No.922 


SEQ, ID. No.923 


SEQ. ID. No.924 


SEQ.ID. No.925 


SEQ.ID. No.926 


SfcQ; ID. No.927 


SEQ.ID. No.928 


SEQ.ID.No.929. 


SEQ.ID. No.930 


SEQ. ID. No;931 


SEQ. ID, No.932 


SfcQ. ID. N0.933 
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E-val 


9.00E-95 
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2,00 E-81 


CO 
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LU 
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o 
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2.00E-23 • 
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LU 
O 
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1.00E-30 


1.00E-65. 


2.00E-42 


1.00E-119 


6.00E-55 


8.00E-47 


- 

UJ 

o 
o 

eg 


8.00E-30 


1,00E-139" 


1.00E44 


1.00E-137 . 


5.00E-20 


o 


4.00E-51 


3.00E-56 


5.00E-25 


CO 

LU 
o 
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Best Blast.Hit ID 


PIR:T30909 


PIR:T10932 


TREMBLNEW:AP001509_46 


PIR:S32238 


TREMBL:SCE87_34 


PIR:F70601 


CO 

eg 

00 

o 

LL. 

< 

li 
CO 
S 

LU 

tr 
i- 


PIR:T21246 


PIR:E70743 


PIR:G70593 


PIR:B70876 


PIRNEW:A81861 


TREMBL:SCM1.0_29 


PIRNEW.F82068 


PIR:G70928 


PIR:T35564 


CO 

5. 

CO 

O 
to 
li 
CO 

LU 
DC 


TREMBL:AB'009593_3 


SWISSPROT:ADH2_ENTHI 


PIR:H70641 


PIR:A70642 


PIR:T45366 


PIR:C70642 


Function . 


endo-1 ,4-beta-xylanase 


polyribonucleotide nucleotidyltransferase 


Unknown 


ribosomal protein ■ 


CD 
CO 
CO 
Ul 

6 
u 

e- 

CD 

o 

e 
I 

co 

v. 

cu 
as 

r 

o 

.co 


Unknown ' 


chaperonin groES ■ 


cystathionine gamma-synthase homolog 


UDP-N-acetylpyruvoylglucosamlne reductase 


catlonic amino acid transporter- 


aminotransferase' 


DNA-DAMAGE-INDUCIBLE PROtEIN PimpB/mucB/ 
samB family protein 


Unknown 


dehydrogenase 


4-ALPHA-GLUCANOTRANSFERASE 


ribosomal protein 


glycosyl hydrolase 


N-acetylglucosamlhe repressor 


ALCOHOL DEHYDROGENASE 2 


ribosomal protein L3 


ribosomal protein L4 


ribosomal protein L23 


ribosomal protein L2 


Stop 


1950319 


1955024 


1955457 


1.958720 


1966046 


1967511 


1969482 


1970162 


1973656 


1975334 


1977065 


1977098 


1981420 


1982510 


1986297 


1987658 


1987755 


,1990292 


1994723 


1997283 


1997946 


1998248 


1999115 


Start 


1947446 


1952286 


1956848 


1958007 


1966792 


1966819 


1969.192 


1971682 


1972559 


1973811 


.1975833 


1978441 


1980080 


1981497 


1984135 


1987170 


1990292 


1991482 


1991997 


1996645 


1997293 


1997955 


1998288 


Code 


ORF3172 


ORF3181 


ORF3184 


ORF3189 


ORF3205 


ORF3206 


ORF3211 


ORF3214 


ORF3221 


ORF3222 


ORF3225 


ORF3226 


ORF3231 


ORF3234 


QRF3243 


ORF3247 


ORF3248 


ORF3250 


ORF3254 


ORF3259 


ORF3260 


ORF3261 


ORF3262 


Seq.lD 


SEQ. ID. No.934 


: SEQ.ID. No.935 


SEQ. ID. No.936 


SEQ. ID. No.937 


. SEQ. ID. No.938 


SEQ. ID. No.939 


SEQ.ID. No.940 


SEQ.ID. No.941 


. SEQ.ID. No.942 


SEQ.ID. No.943 


SEQ. ID. No.944. 


SEQ.ID. No.945 


SEQ.ID. No.946 


SEQ.ID. No.947 


" SEQ. ID, NO.948 


SEQ.ID. No.949 


SEQ.ID. No.950 


SEQ.iD. No.951 


SEQ.ID. No.952 


SEQ.ID. No.953 


SEQ.ID. No.954 


SEQ.ID. No.955 


SEQ.ID. No.956 
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E-val 


2,00 E-42 


4.Q0E-74 


5.00E-48 


7.00E-68 


1.00E-46 


1.00E-37 


4.00E-50 


8.00E43 


1.00E-137. 


2.00E46 


1.00E-134 


8.00E-36 


6.00E-54 


1.00E-102 


4.00E-21. 


1.00E-158 


1.00E-39 


5.00E-78 


o 


1.00E-24 


4.00E-53 


8,00 E-35 


1.00E-111 


2.00E-35 


1 ,00E-57 


10 
15 




Best Blast Hit ID 


PIH:T45368 


PIR:F70642 


PIR:G70642 


PIR:S29884 


PIR:S50001 


PIR:S29887 


SWISSPROT:RS5_STRCO 


o 

CD 

co 

Ol 
CM 

CO 
CC 

CL 


PIR:JC42B8 


PIR:S17070 


PIR:F70565 


PIR:T35559 


PIR:T35560 


PIR:C75323 


PIR:F75328 


PIR:S55274 


CM 

cm' 

C\ 

C3 
O 
CO 

UJ 
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_l 
m 
5' 
in 
tr 
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PIR:D70588 


PIR:B70694 . 


PIR:T35987 


PIR:T35986 


PIR:T35984 


PIR:T36362 


PIR:G69180 


PIR:H64756 


20 

25 
30 
35 


Table I (continued) 


Function 


ribosomal protein 


ribosomal protein S3 


ribosomal protein L16 rpIP 


Ribosomal protein L5 


ribosomal protein L6 


ribosomal protein L1 8 


30S RIBOSOMAL PROTEIN S5. 


Ribosomal proteiii 


preprotein translocase 


adenylate kinase 


DNA-directed RNA polymerase alpha chain 


ribosomal protein L17 


pseudouridylate synthase 


Unknown 


Unknown 


alpha-L-arablnofuranosidase 


Transcriptional regulators of the Lad family 


transcription termination-antitermination factor 


translation Initiation factor jF-2 


ribosome-blnding factor A 


tRNA pseudouridlne synthase . ■ 


probable riboflavin kinase (FAD synthetase) 


DNA repair protein 


ribose 5-phosphate isomerase 


mercury(ll) reductase 


40 




Stop 


1999409 


2000591 


2001017 


2002900 


2004132 


2004505 


2005233 


2005879 


2007490 


2008220 , 


2010840 


2011473 


2011561 


2014922 


2015929 


2018307 


2019883 


2021927 


2025061 


2025687 


2026852 . 


2028074 


2028095 


2030827 


2033.179 


45 




Start 


1999134 


1999791 


2000601 


2002331 


2003596 


2004137 


2004535 


2005430 


2006156 


2007663 


2009848 


2010943 


2012469 


2012670 


2015132 


2016508 


2018843 


2020863 


2022200 


2025217 


2025578 


2026953 


2029630 


2031522 


2034810 
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Code 
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ORF3265 


ORF3266 


ORF3270 


ORF3273 


r- 
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ORF3276 
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ORF3280 


ORF3282 


ORF3288 


ORF3289 


. ORF3290 
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o> 
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O 


ORF3298 


ORF3301 


ORF3304 


ORF3305 


ORF3306 


ORF3308 


ORF3310 


ORF3312 


ORF3316 


o 

CM 

co 

tr 
O 
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Seq.lD 


SEQ. ID. No.957 


SEQ. ID. No.958 


SEQ.ID. No.959 


SEQ.ID. No.960 


SEQ. ID. No.961 


SEQ. ID. No.962 


SEQ. ID. No.963 


SEQ. ID. No.964 


SEQ. ID. No.665 


SEQ. ID. No.966 


. SEQ.ID. No.967 


SEQ.ID. No.968 


SEQ.ID. No.969. 


SEQ. ID. No.970 


. SEQ. ID. No.971 


SEQ. ID. No;972 


SEQ.ID. No.973 


SEQ. ID. No.974 


SEQ.ID. No.975 


SEQ.ID. No,976 


SEQ, ID. No.977 


SEQ. ID. No.978 


SEQ. ID. No.979 


. SEQ. ID. No.980 


SEQ.ID. No.981 • 
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E-val 
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1.00E-26 
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1 ,00E-46 


1,00E-110 


3.00E-20 


1.00E-109 
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5.00E-74 


1.00E-30 
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Best Blast Hit ID 
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5 
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O 
CO 

_i 

m 

s 

Ul 

cc 

h- 


TREMBL:AF045552_3 


TREMBL:AF045481_1 


PIR:S47216 


PIR:T36067 


TREMBL:SCF91_20 . 


TREMBL:SC6D11_6- 
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5 

CO 

O 

CO 

li 

03 

Ul 

DC 
1- 


PIR;S77134 


TREMBL:AF082008_2 


PIR:D6962B 


PIR:T35501- 


TREMBL:ECU73857_11'5 


PIR:G7Q954 


PIR:A70546 . 


PIR:S30886 


PIR:G70533 


CM 

< 
CO 

a 
o 

CO 
Ul 

z 

_J 
ca 

Ul 

CC 

■1- 


PIR:T45349 . 


TREMBL:STH243106_4 


PIR.C70881 


TREMBL:AF006616_1 


PIR:T37174 


PIR:G69592 


PIR:G65081 


SO 
25 
30 
S3 


Table I (continued) 


Function 


ph osphoglucomutase 


D-xylose proton-symporter 


glucose permease 


transcription antiterminator 


seryl-tRNA synthase 


sugar-binding protein 


ABC transporter sugar permease 


ABC transporter sugar permease 


Unknown 


galactbse-1 -phosphate uridylyltransferase 


UDPglucose 4-epimerase 


two-component system regulator 


chloramphenicol resistance protein homolog 


lysine-tRNA ligase 


1 ,4-dlhydroxy-2-naphthoate octaprenyltransferase 


phosphoglycerate mutase 


phosphate transport system regulatory protein PhoU-like 


Sensory transduction histidine kinases 


phosphoserina aminotransferase 


Unknown 


thymidylate synthase 


dihydrofolate reductase 


protein-tyrosine-phosphatase 


branched-chaln amino acid transport protein 


Unknown 


40 




Stop 


2037427 


2039190 


2043612 


2044469 


2047194 


2049516 


2050678 


2051628 


2055210 


2056804 


2057895 


2057949 


2067535 


2070902 


2071929 


2072729 


2073094 


2075147 


2075663 


2078149 


2082683 


2083455 


2084097 


.2084611 


2085851 


45 




Start 


2039100 


2040740 


2041240 


2043633 


2045875 


2048203 


2049659 


2050681 


2054134 


2055260 


2056876. 


2058641. 


2068722 


. 2069223 


2070967 


2071962 


2073765 


2073780 


2076802 


2077196 


2081808 


2082514 


2083582 


2085624 


2086942 


50 




Code 


ORF3327 


ORF3328. 


ORF3330 


ORF3331 


ORF3335 


ORF3339 


ORF3341 


ORF3343 


ORF3347 


ORF3351 


ORF3354 


ORF3355 


ORF3368 


ORF3372 


ORF3373 


ORF3374 


ORF3376 


ORF3378 


ORF3381 


ORF3383 


ORF3392 


ORF3393 


ORF3395 ■ 


ORF3397 


ORF3400 
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Seq.lD 


SEQ. ID. No.982 


SEQ.ID. No.983 


. SEQ. ID. No.984 


SEQ.ID, No.985 


SEQ. ID. No.986 


SEQ. ID. No.987 
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CO 

cn . 
d 
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9 
d 

Ul 

CO 


SEQ.ID. No.989 


SEQ. ID. No.990 


SEQ. ID. No.991 


SEQ.ID. No.992 . 


SEQ. ID. No.993 


SEQ.ID. No.994 


SEQ. ID. No.995 


SEQ. ID. No.996 


SEQ. ID. No.997 


SEQ. ID. No.998 


SEQ.ID. No.999 


SEQ. ID. No.1000 


SEQ. ID. No.1001 


SEQ.ID. No.1002 


SEQ. ID. No.1003 


SEQ. ID.No.1004 


SEQ. ID. No.1005 


SEQ. ID. No.1006 
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Best Blast Hit ID 
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PIR:RDECLA 


PIR:T35137 


TREMBL:PPRRMP_7 


TREMBLNEW:SCD10_29 


PIR:T36334 


PIR:T36332 


PIR:T36331 


PIR:T36330 


PIR:T36329 


PIRT36324 


PIR:T36327 


PIRNEW.A82248 


PIR;D70857 


PIR:S26208. 


PIR:T35675 


PIR:T35673 


TREMBL:SC4A7_32 


TREMBLAF160811J 


TREMBL:AF160811_2 


TREMBL:OOE250422_3' 


TREMBL:CBFMS 1 


PIR:JC1137. 


PIR:H70582 ■ 


20 
25 
30 
35 


Table I (continued) 


Function 


UDP-glucose 4-epimerase 


cyclopropane-fatty-acyl-phosphollpld synthase 


lactaldehyde reductase 


Unknown 


glycosyltransferase 


ABC transporter, ATP-binding protein 


Unknown 


Unknown 


hypoxanthine phosphorlbosyltransferase 


cell division protein ftsH2 


. GTP cyclohydrolase I 


Dihydropteroate synthase - Streptomyces 


hydroxymethyldlhydropterldine 


acyl-CoA thloesterase II 


ABC transporter ATP-bindlng protein 


glucose kinase 


ABC-transport system ATP binding protein 


transcription repressor 


multiple sugar-binding protein 


L-arablnose transport ATP binding protein 


L-arablnose membrane permease 


permease/. . 


Polypeptide deformylase 


xylose isomerase 


transposase 
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Stop 


2088046 


2068993 


2091882 


2094621 


2095844 


2096895 


2100078 


2101334 


2101884 


2103971 


. 21 04665 ■ 


2105603 


2107126 


2107934 


2109145 


2112601 


21.12632 


2113536 


2116108 


2117762 


2118979 


2120491 


2120918 


.2124867 


2126410 
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Start 


2087036 


2090294 


2090734 


2092840 


2094624 


2095942 


2098591 

; . 


2100171 


2101324 


2101884 


21 04069 


2104731 


2105717 


2108830 


2110821 


2111654 


2113495. 


2114702 


2114954 


2116212 


2117765 


2120072 


2120502 


2123521 


2125136 
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Code 


ORF3403 


ORF3408 


ORF3410 


ORF3413 


ORF3419 


ORF3420 


ORF3423 


ORF3426 


ORF3428 


ORF3429 
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ORF3441 
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.ID. No.1007 


ID. No.1008 


ID. No.1009 


ID. No.1010 


ID. No.1011 


ID.No.1012 


ID. No. 101 3 


ID. No.1014 
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ID.No.1016 
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ID. No.1017 


ID.No.1018 


ID. No.1019 


ID. No. 1020 


ID. No. 1021 


ID. No.1022 


D. No. 1023 


ID. No.1024 


ID. No. 1025 


D. No. 1026 


D. No.1027 


D. Nb.1028 


D. No, 1029 


ID. No.1030 


ID. No. 1031. 
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SEQ 


SEQ 


SEQ 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ. 


SEQ.I 


. SEQ-. 


, SEQ. 


SEQ, 


SEQ, 


SEQ. 


SEQ, 


SEQ; 


SEQ. 
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[0028] The ORFs corresponding to SEQ. ID. NO. 2 to SEQ. ID. NO. 1098 nucleotide sequences are defined in table 
1 , supra, and are repesented by their position in sequence SEQ. ID. No. 1 . For example, the ORF3 sequence is defined 
by the nucleotide sequence between the nucleotides at position 785 and 2455 on the sequence SEQI.No.1, ends 
included. 

[0029] The open reading frames have been identified via homology analyses as well as via analyses of potential 
ORF start sites. It is to be understood that each identified ORF of the invention comprises a nucleotide sequence that 
spans the contiguous nucleotide sequence from the codon immediately 3' to the stop, codon of the preceding ORF and 
through the 5' codon to the next stop codon of SEQ. ID. No. 1 inframe to the ORF nucleotide sequence, 
[0030] Table 1 also depicts the results of homology searches that compared the sequences of the polypeptides 
encoded by each of the ORFs to sequences present in public published databases. It is understood that in one em- 
bodiment, those polypeptides listed in Table 1 as exhibiting greater than about 99 % identity to a polypeptide present 
in a publicly disclosed database are not considered part of the present invention. Likewise in this embodiment, those 
nucleotide sequences encoding such polypeptides are not considered part of the invention. 

[0031] As regards the homology with the ORF nucleotide sequences, the homologous sequences exhibiting a per- 
centage identity with the bases of one of the ORF nucleotide sequences of at least 80%, preferably 90% and 95%, are 
preferred. Such homologous sequences are identified routinely via, for example; the algorithms described above and 
In the examples below. The said homologous sequences correspond to the homologous sequences as defined above 
and may comprise, for example, the sequences corresponding to the ORF sequences of a bacterium belonging to the 
Bifidobacterium family. 

[0032] These homologous sequences may likewise correspond to variations linked to mutations within the same 
species or between species and may correspond in particular to truncations, substitutions, deletions and/or additions 
of at least one nucleotide. The said homologous sequences may also correspond to variations linked to the degeneracy 
of the genetic code or to a bias in the genetic code which is specific to the family, to the species or to the variant and 
which are likely to be present in Bifidobacterium. 

[0033] According to a preferred embodimentthe polynucleotide is a nucleotide sequence, which encodes the follow- 
ing polypeptides or fragments thereof: 



(a) SEQ. ID. No.S (ORF13) 



MPTGRVRWFDAAKGYGFITSEEGKDVFLPAQALPTGVTTLRKGAKVEYSV 
VDGRRGPQAMDVRLIASAPSLVKATRPKADDMAAICEDLIKMLDAAGNTL 
RRHRYPSAADS KKLATLLRAVADQFDVQD 
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(b) SEQ. ID. No.525 (ORF1827) 

MTTAAAQAPAPGKLE FKDD YT PDEAERVT RNS KGL P VGVR PKMVWTWKKA 
LLWAAIAIVCACGWAILAVSRGEQISAIWFLVVALSSYAIAYRFYAYYIQ 
IKIMRTDDANATPAERVHDGANFERTDRRVLFGQHFAGISGAGPLVGPIL 
AAQMGYL PSTLWI I LC^IFAGAVQDMLVLWISAKRRGRSLGQMATDEMGK 
FGGMILS I FLVVMTA I AMAFIiALVAI KAMAA S PWAVFS I GMT I P I AL I MG 
CYQRFLRPGRVIETTLLGFVLLVLDIVAGGWIASIPAVAAVFTLDAKQLV 
IALVI YSFAAAALPHWLLVTPRDY1.STLMKIGTLVLLVIGI I IANPSVKV 
PGLTELASTSTGPTFSGNLFPFLFiTIACGALSGFHGAVSSGLTPKAVEK 
ENQIRMIGYGSMLVESFTAVIALIAAITISQGVYFSTNMSAAQITAASGV 
S I SATSTPGEQADAAVKAVESMKVSDIEGNQMQVTWDSVDENGAAKT YEG 
AAALEQAAADIGETSIVSRTGGATTFAMGMANFLKSYLGGHDSMAFWYHF 
A I MFEAli F I IiTTVDNGTRVAR YQ I GEMLGNVRKLKKFADP T WKPGN 
1 1 TTLIATALWGGLIiWMGVSDANGGINAMVP I FGI SNQLiiAAACFV 
LITVCVAKMGYWKHL.WI PVVPLVWDI AVTFTADFQKI FGPLS YFTT 
20 AS KYQAQ I DSGELTGEALTNAKAALSNAYLDGVLS VFFLVMMGVFV 

WGIVWARTFAAGKYGAETTSEEPFVESQWFAPSSLVATALEKKV 
QREYSAKLHELVRNGQVAA 
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c) SEQ. ED. No.424 (ORF1473) 



MTQSRRMLVLRAWED Y I RSQEPVGSTSLTRDHDLGVS SAT I RNDMAALE 
DEGYLIQPHTSAGRVPTEKGYRYFVDRLATWPLSEAQRRGINSFLSGSV 
SLK13AIiQRSARLLSEITGQVAVVASPSLAiG^TLRHVEiyrVPVAMTTLLAW 
ITDTGRVAQHGLTIASMPAVDEINRLSNTVNEQCDGLSIiSKSAETVRSIA 
ASAGYESVRGVADTLADAFESMALDERA^LYMSGTSHLAHSRSLADLAP 
LFDALEEQVVLMKLMSNLSEETNASGVGVAIGSEMHTPGLIiHASVVSSGY 
GRSGAAGEPAGNDPVGEPETESETESQTNDTEPIAFVGSIGPTHMDYAAT 
MAAVRAVARYIiTAFL S EGRTQD 



d) SEQ. ID. No.548 (ORF1905) 



MYFKDGNDNAQRGGSTVRRSRQRRIMGRWSYNEDVPRCTFCGKTEHQVR 
KLVAGPNAS I CDEC IALCVDI I SEERVKDAEVNSLSLPKPAQ I FD YUJRY 
VIGQENAKRALSVAVYNHYKRVNMELQESAEQLDGNNGHSGQTSKQAKQS 
VPTQTRATRRSNDPIJ^VEVAKSNILLLGPTGVGKTYIAQAIARVMNVPF 
„ VITDATTLTEAGYVGDDVETVLQRLLQAADGDVSRAQHGIIYIDEIDKIA 
RKSGENTS I TRDVSGEGVQQALLKILEGTIAS VPLEGTRKHKEQDVAQMD 
TRG I LF I CGGAFVGLTD I VRKRLGRRETGFGANWHDADMKDEELLEQVNA 
DDLAEFGLLPEF IGRLPVTS VLKELTVDDLTAI LTQPANAL I KQYRKLFA 
VDGVDLQFTEQAIRAIADIAIKQGTGARGLRSI IERTLQDTMFQLPSLDD 
55 VRQVIVDKASVEGSSTPKLLREAVDVPQGRLKVAQSVLLDRVRTHEA 
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e) SEQ. ID. No.74 (ORP219) 



MSEQLMEQ YRLRGQRKCRNAC I AAI VTVVLVLAVAGGVWWTAGDGSALVR 
NMFKPKATPATQPVVNSTATFAYRTAPEFIAMEAGDRGTGNVNYSPASMW 
MALAIAAQGANGTTRSQLiNELLtGSGSLTDSDYQSLLS S INGQYSGAKSEM 
SAANSLWIDDDYSIASDYQSWKKMFEAEVTTLPFDDQAAAKMSDWIAKH 
TNGS LKPKI TLRDREVLS 1 1 NTVYADGR WKDPF EE Q S TGNGTFHG EAGDA 
QVPMMHQTFSQMAYGHDEYNTWQRVEIPFDNGGNLAIVLPAEGHFDELAG 
DAEKLSWAFGTCSTASLGEGAMGCAADSMPGWGVSVNSVMVNVTLiPRFTI 
DSMFDSEATIKAFEKLGVTDAFSAGDADFTKMIDTGSHGENLYIGSILQG 
TRI EVNEAGAKAMS FTKVGADS VSAPVDNVEFTVDRPFLYS YVTPDGI PL 
F I GAVRNLGGVGGEN 



f) SEQ. ID. No.576 (ORF1972) 

MQIRPGSMYPLGASYDGAGVNFALYSQVAQKVELCLFDEHDVETRIEMTE 
RNSYVWHNYIPGLHPGQRYGYRVYGPYDPVHGLRCNPNKLLLDPYAKAIE 
GNIDGDESLFSYWFKSPDDNSAMNDLDSAAHTMKSAVINPYFDWGNDQHP 
Y I S YHD S VI YEAHVRGMTNLNMDVPPDIRGTYAGLA YP S VI E YLKKLG I T 
AIELMPIHQFVNDSFLQEKGLSNYWGYNTIGFFAPHNAYSSSGERGEQVN 
E FKSMVKAYHRAGMEVI LD WYNHTAEGNHMGPTL S FKG I DNAS Y YRL VE 
GIXJQHYFDTTGTGNSLLMRSPHALQLITOSLRYWVTEMHVDGFRFDLAAT 
LARQFQEVDKLSAFFDIVEQDPIISRVKLIAEPWDLGSGGYQVGGFPSSW 
S EWNGRYRDT VRDFWRS QPiSTLPEFASRLMGSSDL YQVNGRR P VAS VNF I 



TAHDGFTMNDLVSYNEKHNEANGEGNRDGESNNRSWNGGVEGPTNIPDVN 
DLRQRQMRNMFATLLFSQGIPMICGGDEVARTQQGNNNAYCQDNEISWTN 
WHIJD KGRKELLAFVS KL I HLRLDHP VLHRRRFFTGRE PGDDSNT I PQVEW 
FDHTGS IMDMDDWQNTHAFSMMI YLNGSDI PEVDWYGNRMVDNDF I LI FN 

AHYEPIMFTLPDERYGRKWQLWDTHNPNEPALSYEAGFMITAQSRSFLM 
LMSAKKPKKPMGL 



g) SEQ. ID. No.4Q3 (ORF1403) 

MEIRARPHMASGSYRELFSARMMYGMQYIQQTIVGIDGSEARFFGYVADN 
SEEMEPDRIRPAILILPGGGYAMTSDREAEPVALQFLAKGFAVFVIjRYSV 
QPSRYPVALLFJVAEAMRLIRANVDQWHVNPAQVAVLGFSAGGHLAANLAT 
SVGDEDIREO^GMDPDAVRPNALMLSYPVITAGKYAHRGSFQCLLGDQAH 
NQALIjDKFS I ekh idaktp P VFVWHTMTDDAVPVENTLML I qacraagvs 
I EAHLFPEGSHGLSIJVNAETAGNGFYAHIVECVQCWPDIjAEAWLRRIjF 
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(h) SEQ. ID. No.804 (ORF2676) 

MFLKPEQQLERCRRIVRQRVDPHIHPSIAQLWESYDIPGEPMPSDEFFA 
KLDRGDI DFKP FMLGS EWGTTWGTVWFRLTGTVPAGYPKGKPLEL I LDLG 
WYPHSCGGH I EGLVYRADGTAI KAVHPLNYWVP FMDAEGNAQVPVAEDGS 
FTLYLEAASNPLLLGVPPF I ETELGDHATGKPDE PYVFKSADLAEFDERY 
ENYS VDLDWS S LMEFADKQS PRYWQLAKALQRS LNAYDERNPES VEAAR 
AVTAGVLAKPANASAMNVSAI GHAH IDSAWLWP VRETRRKVARTVSNALA 
LMDADPDFKYAMSSAQQYAWLEEDHPDI FKRMKRRI EEGRF I PVGGMWVE 
ADGMLPAGES L I RQ I AYGRKYFKEHLGVE PKGVWLPDS FGYTGAWPQ I AR 
RAGYE WFLTQK I S WNDTTKFPHHS FMWEG I DGSR I FTHFP PADTYAAWCK 
VQEU3YAEKNFQDKDLSDRSLLLFGFGDGGGGPTRNN1MEHLHRYENLEGV 
S KVS I EEPNDFFDKAHQQLAENAGPEMPVWKGEL YLELHRGTLTSQQDMK 
RGCRQEESLLRlVEYLGAAAVLSDPEYVYPREELDRIWKTLLiLNQFHDIL 
PGSAIAWVHREAREDYRRDLICRIiAEIAQDMCAvijRKANPQADLLAEARIS 
QFRNDGAS WHANR I NE PTDALS VLTOTLDNGRVLLANGVL S VT I EADGTI 
SSLLDEEHGRELVPAGTRLGQYELIiRDEPAVWDAWEIERESLLMANAVTG 
S I ESVNTENGAAQVHVHTADGDTVI TTTI TLRPGSHTLDFHAD IDWHERE 
RFLKVDIiPLGIVADQATYDCQYGLIRRPIVKNTASDEAKYESSTNRFAII 
GDAGYAAAVINGSVYGSDASPIAGNAAEGRDSGTMFRLSLLSAPTFPDPR 
TD I G SHE FDWS yVADATVDRALDAAGVLN A PVLHD VPD I T PLAS I E S VNG 
TVVLDWMKliADDGSGDLIVRAYEAAGGQADAMLHVCPAIAGASVHETNVLi 
EGDDLAADLPVALiQDGRQNAEGATLHFGPFQLATLRITR 



(i) SEQ. ID. No.313 (ORF1077) 

MISRDGWAVIDDSAANIIIETDTVNGKANPFGTWVSPRATAETDL.YFFGY 
GHR Y I EAVRDF YRLTG PTPLL PR FAMGNWWS R Y YRYTQDG YLALMDRFKR 
EGI PFTTSVIDMDWHRVDDVDPKYGSGWTGYSWNRELFPDPPAFLADLHR 
RGLRTTLNVHPRDGVRAFEDAYPEVAKRVG I DPATEENVE FDLTNPDFVD 
AY FDMHHRMEAEG VD F WVHIiPWQQGG VTRQKGLD PL WMLNHMH YLDS GRGG 



NWPLTFSRYAGPGSHRYPVGFSGDTIVTWESLiAFQPQFTATASNIGYGWW 
SHDIGGHMFGYRNEELEARWYQLGAFSPINRLHSSNSPFSGKEPWNFNRD 
VSAAMVDALRLRHAMMPYLYTMNYRAAEAGRPLVEPMYWQNPDTPDAYEV 
PDEFRFGTELWAPIVSPDDAAACRGRADAWLPQGEWFDFFDGRRYVSSD 
AAGRRLEWRSLDRTPVFAKAGAIVPLQDVAESGETViNSIANPQALRVIiV 
FPGAI^SFVMREDRGTWGAPSADTAIAFTWGGADASPSAFTVAPVTGDTS 
AVPELRDWTWFRGVAPVDAASGVRA^SGEAPVEATVAYDEATMSLTVSV 

tgissaasijrieipgglriadnpvesdAmdlllhaqmlyrtkeialqavh 

KLG I GAl GALRTMNRGPRYANDFWI TDMPDAVAGAIiEE I LLRS 

[0034] According to another preferred embodiment the present invention comprises a polynucleotide encoding a 
fusion protein, comprising any one of SEQ. ID. NO. 2 to SEQ. ID. NO. 1 098 ligated in frame to a polynucleotide encoding 
a heterologous polypeptide. The skilled person is well aquainted with techniques performing such a ligation and ex- 
pressing the corresponding fusion-polypeptide in an appropriate cell. * 
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[0035] The present invention also relates to recombinant vectors for the cloning and/orthe expression of a nucleotide 
sequence according to the present invention; The vectors comprise elements necessary to enable expression and/or 
secretion of the nucleotide sequences in a given host cell, such as a promoter, signals for initiation and for termination 
of translation, as well as appropriate regions for regulation of transcription. For example, expression of a protein or 
s peptide may be controlled by any promoter/enhancer element known In the art. Exemplary promotors are the CMV 
promoter, the SV40 early promoter region, the promoter contained in the 3' long terminal repeat of the ro.us sarcoma 
virus, the heroes thymidine kinase promoter, the regulatory sequences of the metallothionein gene, or, for prokaryotic 
expression systems, the p-lactamase promoter, the tac promoter or the T7 promoter. 

[0Q36] The vector should be capable of being stably maintained in the host cell and may optionally possess particular 
io signals specifying the secretion of the translated protein. These different elements are chosen according to the host 
cell utilized. To this effect the nucleotide sequences according to the invention may be inserted into autonomously- 
replicating vectors within the chosen host, or integrative vectors in the chosen host, such as e.g yeast artificial chro- 
mosomes, plasmids or viral vectors. It will be appreciated that the vector may well be the plasmid accordig to SEQ. 
ID. No 1099 or a recombinant form thereof, which has been supplemented by particular ori's that enable a high copy 
* s number. 

[0037] Any of the standard methods known to those skilled in the art for inserting DNA fragments into a vector may 
be used to construct expression vectors containing a chimeric gene consisting of appropriate transcriptional/transla- 
tional control signals and the protein coding sequences. These methods may include in vitro recombinant DNA and 
synthetic techniques and in vivo recombinants (genetic recombination). 

[0038] The vector may be used for transcripton and/or translation of a nucleic acid comprised by SEQ. ID. NO. 1 to 
produce RNA orantisense RNA, respectively. Such a vectorcan remain episomal or become chromosomaily integrated, 
as long as it can be transcribed to produce the desired transcript. 

[0039] The antisense nucleic acids of the invention comprise a sequence complementary to at least a portion of a 
RNA transcript of a polynucleotide sequence In SEQ. ID. No. 1 , designating a sequence having sufficient complemen- 
tarity to be able to hybridize with the RNA, forming a stable duplex. In the case of double-stranded antisense nucleic 
acid sequence, a single strand of the duplex DNA may thus be tested, or triplex formation may be assayed. 
[0040] Theinvention also encompasses host cells transformed with a nucleic acid oravectoraccordingfo the present 
invention and as described above. These cells may be obtained by introducing into an appropriate cell a nucleotide 
sequence or a vector as defined above, and then culturing the said cell under conditions allowing the replication and/ 
30 or the expression of the transformed/transfected nucleotide sequence. 

[0041 ] The host cell may be chosen from eukaryotic or prokaryotic system, such as for example bacterial cells, yeast 
cells, animal cells as well as plant cells. In the context of this invention a cell shall be understood to comprise higher 
biological systems. Such as animals, whole plants or parts thereof. Furthermore, a host cell strain may be chosen 
which modulates the expression of the inserted sequences, or modifies and processes the gene product in the specific 
3s fashion desired. 

[0042] A preferred host cell for the expression of the proteins of the invention consists of prokaryotic cells, such as 
gram negative or gram positive bacteria. A further preferred host cell according to the invention is a bacterium belonging 
to the Bifidobacterium family, more preferably belonging to the species Bifidobacterium longum or chosen from a mi- 
croorganism associated with the species Bifidobacterium longum. ' 

[0043] The transformedAransfected cells according to the invention may advantageously serve as a model and may 
be used in methods for studying, identifying and/or selecting compounds capable of being responsible for any of the 
beneficial effects brought about by the present Bifidobacterium strain. 

[0044] The invention further provides polypeptides encoded by the Bifidobacterium longum ORFs, in particularthose 
listed in table 1 and identified in the sequence listings. In the present description, the terms polypeptide, peptide and 
protein are used interchangeably. Furthermore the present invention also pertains to method for preparing such 
polypeptides by recombinant means comprising the steps of (a) culturing a host cell according to the present invention 

underconditionssuitabletoproducethepolypeptideencodedbythepolynucleotide;and(b)recoveringthepolypeptide 
from the culture, 

[0045] It will be appreciated that theabove polypeptides may also be obtained using combinatory chemistry, wherein 
the polypeptide is modified at some locations before testing them in model systems, so as to select the compounds 
which are the most active or which exhibit the desired properties. 

[0046] In this context, chemical synthesis has the advantage of being able to use non-natural amino acids or non- 
peptide bonds. Accordingly, in order to e.g. extend the life of the polypeptides according to the invention, it may be 
advantageous to use such non-natural amino acids, for example in the D form, or alternatively amino acid analogues 
preferably sulphur-containing forms. 

[0047] Finally, the structure of the polypeptides according to the invention, its homologous or modified forms, as well 
as the corresponding fragments may be integrated into chemical structures of the polypeptide type and the like. Ac- 
cordingly, in order to preserve the polypeptide in an in vivo environment it will be preferred to provide at the N- and C- 
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terminal ends compounds which convey a resistance to degradation to; proteases. 

[0048] It will ajso be appreciated that the different polypeptides according to the present invention and produced by 
the above method may represent antigens to. the immune system of a host animal, so that antibodies may be produced 
directed against said polypeptides. These antibodies may be used for the detection of a polypeptide of interest in a 
3 mixture or genetically of a strain of Bifidobacterium in a sample. In addition they may be used as research tools by e. 
g. producing antibodies against cellular surface epitopes and determining the effect of blocking certains polypeptides 
on the bacterial cell wall. Therefore, according to another aspect, the invention provides anlibodies directed to epitopes 
on the various. polypeptides provided by this invention. 

[0049] According to another aspect the present invention also provides a method forthe detection and/or identification 

10 of Bifidobacterium longum in a biological sample. This method may comprise several techniques known in the art, such 
as PCR or simply hybridisation with a suitable probe. Alternatively, an antibody raised against a cell wall epitope of 
Bifidobacterium longum may be used for said purpose. It will be appreciated that the above method may also be 
reversed and the presence of antibodies against Bifidobacterium may be determined by contacting the sample to be 
tested with a polypeptide of Bifidobacterium under conditions to allow formation of immune complexes. 

15 [0050] The polypeptides according to the invention, the antibodies according to the invention described below and 
the nucleotide sequences according to the invention may be used in in vitro and/or in vivo methods for the detection 
and/or the identification of bacteria belonging to the species Bifidobacterium in a biological sample (biological tissue 
or fluid) which is likely to contain them. These methods, depending on the specif icity of the polypeptides, of the anti- 
bodies and of the nucleotide sequences according to the. invention which will be used, may detect and/or identify the 

20 bacterial variants belonging to the species Bifidobacterium as well as associated microorganisms capable of being 
detected by the polypeptides, the antibodies and the nucleotide sequences according to the invention which will be 
chosen. It may, for example, be advantageous to choose a polypeptide, an antibody or a nucleotide sequence according 
to the invention, which is capable of detecting any bacterium of the Bifidobacterium family by choosing a polypeptide, 
an antibody and/or a nucleotide sequence according to the invention which Is specific to the family. 

2s [0051] All the sequences referred to herein (SEQ ID. NO. 1 to SEQ ID: NO. 1099) are listed in the attached sequence 
listings which is to be considered as part of the specification. . 

[0052] the invention also comprises the nucleotide sequences or polypeptides according to the invention covalently 
or noncovalently immobilized on a solid support. In the first case such a support may serve to capture, through specific 
hybridization, the target nucleic acid obtained from a biological sample to be tested. If necessary, the solid support is 

so separated from the sample and the hybridization complex formed between the capture probe and the target nucleic 
acid is then. detected by means of a second probe, called detection probe, labelled with an easily detectable element. 
[0053] Such support may take the form of so-called DNA array or DNA chips, a multitude of molecular probes pre- 
cisely organized or arrayed on a solid support, which will allow sequencing genes, studies of mutations contained 
therein and the expression of genes, and which are currently of interest given their very small size and their high 

35 capacity in terms of number of analyses. 

[0054] The function of these arrays/chips is based on molecular probes, mainly oligonucleotides which are attached 
to a carrier having a size of generally a few square centimetres or more as desired. For an analysis the carrier (DNA 
array/chip) is coated with probes that are arranged at a predetermined location of the carrier. A sample containing 
fragments of a target nucleic acid to be analysed, for example DNA or RNA or cDNA, that has been labelled beforehand, 

40 is subsequently contacted with the DNA array/chip leading to the formation, through hybridization, of a duplex. After 
a washing step, analysis of the surface of the.chip allows the effective hybridizations to be located by means of the 
signals emitted by the labels tagging the target. A hybridization fingerprint results from this analysis which, by appro- 
priate computer processing, allows to retrieve information such as the expression of genes, the presence of specific 
fragments in the sample, the determination of sequences and the presence of mutations. 

45 [0055] The hybridization between the probes of the invention, deposited. or synthesized in situ on the DNA chips, 
and the sample to be analysed, may, e.g. be determined by means of fluorescence, radioactivity or by electronic de- 
tection. 

[0056] The nucleotide sequences according to the invention may be used in DNA arrays/chips to carry out analyses 
of the expression of the Bifidobacterium genes. This analysis is based on DNA arrays/chips on which probes, chosen 

so for their specificity to characterize a given gene, are present. The target sequences to be analysed are labelled before 
being hybridized onto the chip. After washing the labelled compounds are detected and quantified, with the hybridiza- 
tions being earned out at least in duplicate. Comparative analyses of the signal intensities obtained with respect to the 
same probe for different samples and/or for different probes with the same sample, determine a differential transcription 
of RNA derived from the sample. . 

55 [0057] The DNA arrays/chips according to the present invention may also contain nucleotide probes specific for other 
microorganisms, which will enable a serial testing allowing rapid identification of the presence of a microorganism in 
a sample. 

[0058] The principle of the DNA chip, as detailed above may also be used to produce protein chips on which the 
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support has been coated with a polypeptide or an antibody according to the invention, or arrays thereof, in place of 
the DNA. These protein chips make it possible to analyse the biomolecular interactions (BIA) induced by the affinity 
capture of target analytes onto a support coated e.g. with proteins, by surface plasma resonance (SPR). the polypep- 
tides or antibodies according to the invention, capable of specifically binding antibodies or polypeptides derived from 
the sample to be analysed, may thus be used in protein chips for the detection and/or the identification of proteins In 
samples. 

[0059] The present invention also relates to a computer readable medium having recorded thereon one or more 
nucleotide and/or a polypeptide sequences according to the invention. This. medium may also comprise additional 
information extracted from the present invention, such as e.g. analogies with already known sequences and/or infor- 
mation relating to the nucleotide and/or polypeptide sequences of other microorganisms so as to facilitate the com- 
parative analysis and the exploitation of the results obtained. Preferred media are e.g. magnetic, optical, electrical and 
hybrid media such as, for example, floppy disks, CD-ROMs or recording cassettes: 

[0060] The invention also relates to kits or sets forthe detection and/or the identification of bacteria belongingto the 
species Bifidobacterium longum or to associated microorganisms, which comprises, a polypeptide according to the 
invention, where appropriate, the reagents for constituting the medium appropriate for the immunological or specific 
reaction, the reagents allowing the detection of the antigen-antibody complexes produced by the immunological reac- 
tion between the polypeptide (s) of the invention and the antibodies which may be present in the biological sample, it 
being possible forthese reagents also to carry a label, or to be capable of being recognized in turn by a labelled reagent, 
more particularly in the case where the polypeptide according to the invention is not labelled.a reference biological 
sample (negative control) free of antibodies recognized by a polypeptide according to the invention, a reference bio- 
logical sample (positive control) containing a predetermined quantity of antibodies recognized by a polypeptide ac- 
cording to the invention. 

[0061] The invention also relates to a kit or set for the detection and/or the identification of bacteria belongingto the 
species Bifidobacterium longum or to an associated microorganism, or for the detection and/or the identification of a 
microorganism, wherein the kit comprises a protein chip according to the invention. 

[0062] The present invention also pertains to a novel microorganism termed NCC2705, which has been deposited 
according to the Budapest Treaty with the Institute Pasteur on January 29 th , 2001 and received the deposit ho. CNCM 
1-2618. This micro-organism belongs to the genus Bifidobacterium, species Bifidobacterium longum and is a probiotc 
micro-organism, i.e. it may pass the gastro-intestinal tract in an essentially live and viable form and has the capability 
of preventing colonization of the intestine with pathogenic bacteria causing diarrhea and in addition may prevent or 
reduce the occurence of infection of Intestinal cells by rotaviruses. 

[0063] The microorganism is gram positive, catalase negative and C0 2 production negative, it produces L(+) lactic 
acid and essentially prevents colonization of intestinal cells by bacteria bringing about diarrhea, such as pathogenic 
E. coll, e.g. enteropathogenlc E.coli (EPEC), or salmonella, e.g. Salmonella typhlmurium and prevents Infection of 
intestinal cells by rotaviruses. 

[0084] The novel microorganism may be used for the preparation of a variety of carrier materials, such as e.g. milk, 
yogurt, curd, fermented milks, milk based fermented products, fermented cereal based products, milk based powders, 
infant formulae and may be included in the support in an amount of from about 10 s cfu / g to about 10 11 cfu / g. For 
the purpose of the present invention the abbreviation cfu shall designate a "colony forming unit" that, is defined as 
number of bacterial cells as revealed by microbiological counts on agar plates. 

[0065] The present invention also provides a food or a pharmaceutical composition containing at least the Bifido- 
bacterium NCC 2705 and/or containing a supernatant, in which the microorganisms have been grown or an active 
fraction/metabolite thereof, respectively. 

[0066] For preparing a food composition according to the present invention at least one of the Bifidobacteria of the 
present invention is incorporated in a suitable support, in an amount of from about 10 5 cfu / g to about 1 O 12 cfu / g, 
preferably from about 10 s cfu /g to about 10^° cfu / g, more preferably from about 10 7 cfu/g to about 1 0 9 cfu / g. 
[0067] In case of a pharmaceutical preparation the product may be prepared in form of tablets, liquid bacterial sus- 
pensions, dried oral supplements, wet oral supplements, dry tube feeding or a wet tube feeding with the amount of the 
Bifidobacterium/Bifidobacteria to be incorporated therein.being in the range of up to about 1 b 12 cfu /.g, preferably from 
about 10 7 cfu / g to about 10 11 cfu / g, more preferably from about 10 7 cfu./ g to about 10 10 cfu / g. 
[0068] The activity of the novel microorganism in. the Individual's intestine is of course dose dependent. That is, the 
more the novel microorganism or an active component thereof is incorporated by means of ingesting the above food 
material or the pharmaceutical composition the higher the protective and/or curing activity. Since the novel microor- 
ganism is not detrimental to mankind and animals and has eventually been isolated from baby feces a high amount 
thereof may be incorporated so that essentially a high proportion of the individual's intestine will be colonized by the 
novel microorganisms. 

[0069] Yet, according to another preferred embodiment the supernatant of a culture of the Bifidobacterium of the 
present invention, or an active fraction thereof, may be used for preparing the carrier. The supernatant, may be used 
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as such or may be dried under conditions that do not destroy the metabolic compounds secreted by the microrganisms 
into the liquid medium, such as e.g: freeze drying, and may be included in the carrier. In order to minimize the number 
of unknown compounds In the supernatant the Bifidobacteria will preferably be grown in a defined media, the compo- 
sition of which is known and does not negatively affect the host incorporating it. Further, the skilled person will, based 
s on his general knowledge optionally deplete the supernatant from unwanted products, such as e.g. by means of chro- 
matography. 

[0070] The present inventors have investigated baby feces and isolated a variety of different bacterial strains there- 
from. These strains were subsequently examined for their capability to prevent prevent colonization and/or invasion 
of epithelial cells with bacteria that are. known to cause diarrhea, such as E.coli, Sigella, Klebsiella, Yersinia, Pseu- 

10 domonas aeruginosa Listeria, Streptococcus, Staphilococcus, Clostridium difficile, H. pyori and also Candida albicans. 
[0071] Several bacterial genera comprising Bifidobacterium, Lactococcus and Streptococcus were screened for their 
diarrhea inhibitory properties. The tests for the inhibitory property were performed with pathogenic microorganisms, 
such as E.coli, Klebsiella, Yersinia, Pseudomonas aeruginosa, H. pyori, and Salmonella typhimurium as representa- 
tives for pathogenic micro-organisms causing diarrhea in affected individuals. 

15 [0072] The various bacteria were grown in a suitable medium, such as MRS, Hugo-Jago or M 1 7 medium at temper- 
atures of from about 30 to 40°C corresponding to their optimal growth temperature. After reaching stationary growth 
the bacteria were collected by centrifugation and resuspended in physiological NaCI solution. Between the different 
tests the bacterial cells were stored frozen (-20°C). 

[0073] For assessing anti-bacterial properties the following approaches were chosen. 
20 [0074] According to one protocol the cultured Bifidobacterium of the present invention was examined for its capability 
to decrease the viability of the different pathogenic micro-organisms. To this end, a culture of pathogenic bacteria was 
contacted with a concentrated supernatant of a Bifidobacterium culture and the growth potential of the pathogenic 
bacteria was assessed. 

[0075] According to a second protocol the adhesion capability of the Bifidobacteria of the present invention to T M 
zs cells, a cell culture model for the intestine, was determined by culturing the Bifidobacterium with cells and the rate 
of adhesion was assessed. 

[0076] According to another protocol the potential of the Bifidobacterium of the present Invention to prevent infection 
of intestinal cells by Salmonella, using the cell line Caco-2 as a model for the intestine, was determined. In this respect, 
the supernatant of a cell culture of the Bifidobacteria of the present invention was added together with the pathogenic 
30 microorganism to the intestinal cells and the rate of adhesion, or invasion, respectively, was assessed. 

[0077] Thus, it could beshown thatthe cultured Bifidobacterium and the supernatant proofed to be extremely effective 
in preventing both adhesion to and invasion into the intestinal cells Indicating that one or more metabolic compounds 
secreted by the microorganism is/are likely to be responsible for the anti-diarrhea activity. 

[0078] According to yet another protocol it was further assessed, whether NCC2705 would be capable to prevent 
3S invasion of epithelial cells by rotaviruses. Two different protocols were applied. According to one protocol the various 
bacterial strains were examined for their direct interaction with the rotavirus strain while in the second protocol the 
bacteria were screened for those strains that interact with cellular rotavirus receptors. 

[0079] The first protocol involved contacting the respective bacterial suspension each with a different rotavirus strain 
and incubating in suitable media. Subsequently, the virus-bacteria mixture was applied to a monolayer of cells of the 
<H3 human undifferentiated colon adenoma cells HT-29 (intestinal epithelial cell line) and incubation was continued. Virus 
replication was then assayed. 

[0080] The second protocol involved incubating the respective bacterial suspension first together with a monolayer 
of cells, of the human undifferentiated.colon adenoma cells HT-29 and adding the virus subsequently. After continued 
incubation . virus replication Was assayed. 
•« [0081] Rotavirus replication was assessed by histo-immunological staining of rotavirus proteins in infected cells. A 
rotavirus inhibitory effect was attributed to a given bacterium when the number of infected cells was. reduced by 90% 
in the cell culture inoculated with rotavirus plus the indicated bacteria in comparison with cells inoculated only with 
rotavirus. 

[0082] The present.invention will now be described by way of examples without limiting the same thereto, 
so [0083] Media and solutions: 

MRS (Difco) 

Hugo-Jago (tryptone 30g / 1 (Difco), yeast extract 1 0 g / 1 (Difco), lactose 5 g / 1 
(Difco), KH 2 P0 4 6 g / 1 , beef extract 2 g / 1 (Difco), agar 2 g / 1 (Difco)) 
ss M17 (Difco) 

Eugon Tomato Agar (Canned tomato juice 400 ml, Eugon agar BBL 45.5 g. Maltose Difco 1 0 g, Hemln Sigma 5mg, 
Agar Difco 5 g, distilled water 600 ml) 
DMEM (Dulbecco's modified Eagle medium) 
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CFA (according to Ghosh et al. Journal of Clinical Microbiology, 1993 31_ 2163-6) 
Muller Hinton agar (Oxoid) 

LB (Luria Bertami, Maniatis, A Laboratory Handbook, Cold Spring Harbor, 1992) 
C 14 -acetate (53,4 Ci/rnMol, Amersham International PLC) 
PBS (NaCI 8g/l, KCI 0.2 g/l, Na 2 HP0 4 1 .15 g/l, KHgPC^ 0.2 g/l)) 
Trypsin-EDTA solution (Seromed) 
FCS Fetal calf serum (Gibco) 

[0084] E. coli DAEC C 1845 was obtained from Washington University, Seattle and E. coli JPN15 was obtained from 
the Center for Vaccine Development of the University of Maryland, USA). The Salmonella typhimurium strain SL1344 
was obtained from the department of Microbiology, Stanford University, CA, USA. This strain acts as a pathogen on 
mice and is resistant to Streptomycin. It adheres to Caco-2 colon cells (Finlay and Falkow, 1 990). The Klebsiella was 
obtained from stock clinical isolates from the microbiological laboratory of the Facultede Pharmacte Paris XI, Chatenay- 
Malabty, France. The Yersinia was obtained from INSERM Unit 411 , Hopital Necker, Paris, France. The Pseudomonas 
aeruginosa was obtained from stock clinical isolates from the microbiological laboratory of the Faculte de Pharmacia 
Paris XI, Chatenay-Malabry, France. 

[0085] The H: pylori was obtained from Institute of Microbiology, Lausanne University, Lausanne, Switzerland. 
[0086] Human rotavirus Wa (G1 serotype) and simian rotavirus SA-11 (G3 serotype) were obtained from P.A. Offit 
Children's Hospital of Philadelphia, U.S.A. The DS-lxRRV reassortant virus was obtained from A. Kapikian, NIH Bethes- 
da, U.S.A. The serotype 4 human rotavirus Hochl was obtained from P. Bachmann, University of Munich, Germany. 

Example 1 

Isolation of Bifidobacteria 

[0087] Fresh feces were harvested from diapers of 1 6 healthy babies 1 5 to 27 days old. 1 g of fresh feces was p laced 
under anaerobic conditions for transportation to the laboratory and microbiological analyses were run within 2 hours 
from sampling by serial dilutions in Ringer solution and plating on selective media. Eugon Tomato Agar (Canned tomato 
juice 400 ml, Eugon agar BBL 45.5 g, Maltose Dlfco 10 g. Hemin. Sigma 5mg, Agar Difco 5 g, distilled water 600 ml) 
incubated anaerobicaily at 37°C for 48 hours was used to isolate bifidobacteria. Colonies were randomly picked up 
and purified. Physiological and genetic characterisation was performed on the isolates. 



Example 2 
35 Cultivating cell lines 

Caco-2 cells: 



[0088] For the inhibition assays the cell line Caco-2 was utilized as a model of mature enterocytes of the small 
intestine; This cell line presents characteristic of intestinal cells such as e.g. polarization, expression of Intestinal en- 
zymes, production of particular structural polypeptides etc.. The cells were grown on different supports, namely on 
plastic dishes (25 cm 2 , Corning) for growth and propagation, on defatted and sterilized 6 well glass plates (22 x 22 
mm, Coming) forthe adhesion and the inhibition tests. Afterthe second day in curturethemedium(DMEM) was changed 
on a daily basis. Before use the medium was supplemented with 1 00 U/ml penicilline/streptomycine. 1 u.g/ml ampho- 
terine, 20 % FCS inactivated at 56 °C for 30 min and 1 % of a solution containing non-essential amino acids (1 0 mM) 
(Eurobio, Paris, France), Culturing was performed at 37 "C in an atmosphere comprising 90% air and 10% C0 2 . The 
cells were splitted every six days. The cells were detached from the walls of the well by treatment in PBS with 0.015 
% trypsine and 3 mM EDTA at pH 7.2. For neutralizing the effect of trypsine an equal volume of the culture medium 
containing FCS was added to the cell suspension obtained, the mixture was centrifuged (1 0 min at 1 000 rpm) and the 
pellet was again dissolved in culture medium. Living cells (not dyed with trypane blue) were counted. About 3.5 x 1 0 s 
living cells were transferred to a new culture bottle and about 1 .4 x 1 0* cells per well and cultivated until a confluent 
monolayer was obtained. 



[0089] For the adhesion assays the cell line T^ was utilized as a model of colon cells from the intestine; This cell 
line presents characteristics of intestinal cells such as e.g. polarisation, expression of intestinal enzymes, production 
of particular structural polypeptides etc.. T M cells were obtained from University of California, San Diego CA Cells 
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were grown in DMEM.(50%) and Ham's F12 (50%) supplemented with 2 mM glutamine, 50 mM HEPES, 1% non- 
essential amino acids and 1 0% inactivated (30 min, 56"C) fetal calf seaim (Boehringer, Mannheim, Germany) at 37°C 
in a 10% 00^90% air atmosphere. Cells were seeded at a concentration of 10 6 cells per cm 2 . Cells were used for 
adherence assays at late post-confluence, i.e., after 10 days. 

[0090] All strains except Bifidobacteria were kept at -80°C In their culture medium containing 1 5% glycerol. As the 
number of transfers into new media has an influence on the adhesion factors, the Salmonella strain was only transferred 
twice within a period of 24 hours,. the first transfer taking place when the strain was frozen. All cultures were raised 
aerobically. 

Bifidobacteria 

[0091] The bacterial strain (Bifidobacterium longum CNCM 1-261 8 (NCC2705) was stored at -20 °C in MRS medium 
containing 1 5 % glycerol. The strain was grown under anaerobic conditions in M RS and transferred twice to new media 
at intervals of 24 hours before use in the inhibition assays. For the assay a concentration of 2 x 1 0 9 cfu/ml was utilized. 
The supernatant was collected by centrifugation for 1 hour at 20.000 rpm and the supernatant obtained was subse- 
quently checked for the presence of bacteria. The strains of Bifidobacterium were cultivated anaerobically in MRS 
during 18 hours at 37°C. The cultures were then centrifuged (20 min. at 40c), the supernatant was collected, lyophilized, 
returned to the solution and then concentrated ten times (1 Ox). The pH of the supernatant was finally adjusted to 4.5. 

20 E. coli C 1845: 

[0092] The first passage afterthawing was effected on a CFA - Muller Hinton agar, which is suitable to effect expres- 
sion of adhesion factors by the bacterium. Before each experiment the bacterial cells were incubated at 37 "C with a 
transfer to a new medium being effected twice after 24 hours each. 

.25 

Klebsiella: 

[0093] Bacteria were grown overnight for 1 8 hrs at 37°C in Luria broth. 
30 Yersinia: 

[0094] Bacteria were grown overnight for 1 8 hrs at 37°C in Luria broth . 
Pseudomonas aeruginosa: ' 

35 

[0095] Bacteria were grown overnight for 1 8 hrs at 37°C in Luria broth. 
H. pylori: • 

40, [0096] Bacteria were grown on Brain-Heart Infusion (BHI)-agar plates containing 0.25% yeast extract (Difco Labo- 
ratories, Detroit, Ml), 10% horse eerum and 0.4% Campylobacter selective complement (Skirrow supplement, SR69 ; 
Oxoid Ltd, Basingstoke, England). 

Example 4 

45 

[0097] The Caco-2 and monolayers, prepared on glass coverslips which were placed in six-well Corning tissue 
culture plates (Coming Glass Works, Corning, NY), were washed twice with phosphate-buffered saline (PBS). Bifido- 
bacteria (1 ml, 4x10 s bacteria/ml in spent culture supernatant, treated-supernatant or fresh MRS broth) were added 
to 1 ml of the cell line culture mediurn.This suspension (2 ml) was added to each well of the tissue culture plate and 

so the plate incubated at 37 °C in 1 0% COj>/90% air. After 1 hour of incubation, the monolayers were washed five times 
with sterile PBS, fixed with methanol, stained with Gram stain, and examined microscopically. Each adherence assay 
was conducted in duplicate over three successive passages of intestinal cells. For each monolayer on a glass coverslip, 
the number of adherent bacteria was evaluated in 20 random microscopic areas. Adhesion was evaluated by two 
different technicians to eliminate bias. 

55 [0098] The results are shown in Fig. 1 from which it becomes obvious that NCQ2705 is capable to adhere to intestinal 
cells as compared to to the known cell line GG (WO 97/00078), Lai (EP 0 577 903) or another Bifido strain (BL28/Cal). 
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Example 5 

[0099] As candidates for pathogenic bacteria E.coli, Klebsiella, Yersinia, Pseudomonas aeruginosa and H. pyori 
were used. 

[0100] Based on a culture of NCC2705 kept In MRS medium for 18 hours, an exponentially growing culture was 
produced (3 hours at 37°C). 2 ml of this solution were removed and centrifuged for 5 min. at 5500 g, +4 °C. After 
collection of the supernatant the cell pellet was washed in sterile PBS. After centrifuging, the pellet was collected and 
2 ml of sterile PBS were added. The bacteria were counted and the suspension was adapted in such a way that between 
1 and 5 x 1 0 s bacteria / ml were produced. 

[0101] The assessment of the antimicrobial effect exerted by the Bifidobacteria of the present invention was carried 
out according to the Lehrer method described in Lehrer et al., J. Imiinol. Methods 1 37 (1 99 1 ), 1 67-1 73, which document 
is incorporated here by way of reference. The results thereof are shown in Fig. 2 and 3. 

[0102] From the above results it may be seen that the Bifidobacterium of the present invention may effectively inhibit 
growth of the various pathogenic bacteria. 

Example 6 

Inhibition assay for salmonella 

[0103] Salmonella are bacteria that invade epithelial cells and multiply therein. For determining the inhibitory activity 
of the Bifidobacteria of the present invention towards Salmonella typhimurium the strain SL1344 and following proce- 
dure was used. 

[0104] The pathogenic cells were cultivated in LB-medium. After the second passage to new medium the bacterial 
strains were marked with radioisotopes using C^-acetate at 10 uCi/ml in LB-medium. Incubation of the strains in this 
medium was performed for 1 8 hours at 37 °C. 

[01 OS] The bacterial suspension was subsequently subjected to centrifugation (1041 rpm, 15 min) so as to eliminate 
the remaining C 14 -acetate from the supernatant. The pellet was suspended and washed in PBS and the cells were 
suspended at a concentration of about 1 0* cells / ml in 1 % sterile mannose. Mannose is known to inhibit non specific 
adhesion. The bacterial solution was then adjusted to 2 x 10 8 cells/ml. 

[0106] The pathogen (1 ml; 2 x 10 s cells) and an aliquot of a supernatant (1 ml) of a Bifidobacterium culture are pre- 
incubated for 2 hours at 37°C. The suspension is subsequently centrifuged, the resulting supernatant is removed and 
the pellet is again suspended in 0.5 ml PBS. This pathogen solution (0,5 ml) is then brought in contact with human 
Intestine cells In culture. The culture was washed with sterile PBS twice and 0,5 ml adhesion medium (DMEM) was 
added. The cells are then incubated for 1 hour at 37°C under 10% CO z . 

[0107] After incubation the number of bacteria in the incubation medium and on/in the intestinal cells are counted. 
In order to determine the amount of cells adhering on or having invaded into the intestinal cells the following approaches 
have been chosen. 

[0108] For determining the number of adhering bacteria the medium was decanted and the cells were washed once 
with culture medium and once with sterile PBS. Subsequently, 1 ml of sterile H 2 d was added per compartment, to lyse 
the cells and to form a cell solution which was incubated for 1-2 hours at 37°C, after which successive dilutions were 
carried out. In order to count the number of adhering and invasive bacteria, the cell solution was centrifuged to. remove 
cell debris and the radioactivity was measured. 

[0109] According to another protocol 10 aliquots were each put on TSA medium. The media were then incubated 
for 18-24 hours at 37° C. . 

[0110] For determining the amount of invaded bacteria the Caco-2 cells were washed with PBS so as to eliminate 
all non-adhering cells. Subsequently, a medium containing gentamycin (20 ug/ml) was added and incubation was 
continued for 1 hour at 37 °C. Gentamycin is an antibiotic not penetrating intestinal cells so that all extracellular mi- 
croorganisms were killed, while bacteria having already invaded intestinal cells will survive, The cells were then incu- 
bated for another hour at 37°C and werethen washed twice with PBS. The cells were lysed by addition of and incubation 
in sterile distilled water for for 1 -2 hours at 37°C. After removing the cell debris radioactivity was determined. According 
to another protocol successive dilutions were carried out, which were put on TSA medium. Incubation' 18-24 hours at 
37°C. 

[01 1 1 ] It may be seen that cultured cells and the culture supernatant were extremely effective in preventing adhesion 
of and invasion into intestinal cells by Salmonella. 
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Example 7 

Infection of mice by the strain S. typhlmurlum C5 

s [0112] Adult, 7-8 weeks old, axenic, female mice (C3H/He/oujco conventional, Iffa Credo, France), raised under 
sterile conditions, were orally infected with a fixed concentration of S. typhimurium (0,2 ml, 1 0 8 cfu/mouse). Some mice 
were rendered monoxenic by the implantation of a range of Bifidobacteria strains. With some mice, the Bifidobacteria 
in segments of the intestine were counted after its removal and mincing of the organs in PBS. With other mice, the 
protection against infection was assessed in such a way that they were continuously kept in a sterile environment and 

to the days of survival were compared to the control group. 

[0113] The results are shown in Fig. 4. As may be derived therefrom in the control group nearly all mice died after 
a time period of about 1 0 days. In contrast thereto, all mice treated with NCC2705 were alive after 1 0 days with only 
20 % dying from the detrimental effect exerted by Salmonella after a period of 30 days. 

15 Example 8 

1 st protocol: 

[0114] 30 u.l of the respective bacterial suspension (containing on average 3x1 0 6 bacteria) were mixed with 70 u.l 
20 M1 99 medium supplemented with 1 0% tryptose phosphate broth (Flow) and 5% trypsin-EDTA solution (Seromed) to 
which were added 1 00 u.1 of virus in supplemented M1 99 medium. The virus-bacteria mixture thus obtained was incu- 
bated for: 1 hour at 4°C and for 1 hour at 37° C. Separately, cells of the human undifferentiated colon adenoma cells 
HT-29 growing as a confluent monolayer in96-well microtiter plates (in M1 99 medium supplemented with 10% tryptose 
phosphate broth (Flow) and 5% trypsin-EDTA solution (Seromed) 1 : 4 diluted with PBS) were washed three times with 
25 phosphate-buffered saline (PBS ; pH 7.2). The virus-bacteria mixture processed as indicated above was transferred 
to the cells and the microtiter plates were incubated for 18 h in a C0 2 incubator (Heraeus). Virus replication was assayed 
as described below. 

2"? protocol: ' 

30 

[0115] 30 nfof the bacterial suspension (supra) were mixed with 70 u.l M1 99 medium supplemented with 10%tryptose 
phosphate broth (Flow) and 5% trypsin-EDTA solution (Seromed ) and applied directly on HT-29 cells grown and pre- 
treated as described in the 1 s * protocol in the microtiter plates. After one hour Incubation at 37°C 100 uJ of virus in 
supplemented M 199 medium were added to the cells in the microtiter plates. The incubation was continued for 18 h 
35 in a CO a incubator (Heraeus). Virus replication was assayed as described below. 

[0116] The rotavirus replication was assessed by histo-immunological staining of rotavirus proteins in infected cells 
as described hereafter. 

[0117] One day after infection, the cell culture medium was removed from the microtiter plates and the cells were 
fixed with absolute ethanol for 10 min. Ethanol was discarded, and the plates were washed three times in PBS buffer. 

40 Then 50 u.l of an anti-rotavirus serum (mainly directed against VP6 protein), produced in rabbits (obtained from the 
ISREC University of Lausanne) and diluted 1 :2000 in PBS was added to each well and incubated for 1 h at 37°C with 
a cover slip to prevent desiccation of the wells. The anti-serum was discarded afterwards and the plates were washed 
three times with PBS. Then 50 u.1 of anti-rabbit immunoglobulin G (IgG) antiserum produced in goats and coupled to 
peroxidase (GAR-lgG-PO; Nordic) were added at a dilution of 1 : 500 in PBS to each well and the plates were incubated 

45 for 1 hour at 37 °C. The serum was discarded and the plates were again washed three times with PBS. Then 1 00 u.1 
of the following substrate mixture was added to each well: 1 0 ml of 0.05 M Tris-hydrochloride (pH-7.8), 1 ml of H 2 O a 
(30% suprapur, diluted 1 :600 in r-t,0 . Merck) and 200 utt of 3-amino-9-ethylcarbazole (0.1 g/1 0 ml of ethanol stored 
in 200 u.l aliquots at -80 °C ; A-5754 ; Sigma). The plates were incubated for at least 30 min at room temperature. The 
substrate was discarded andthe wells were filled with 200 uJ of H a O to stop the reaction. Infected cell foci were counted 

50 with an inverted microscope (Diavert; Leitz). 

[0118] Only very few bacterial strains interacted with rotaviruses. Merely 4 out of the 260. bacterial cells primarily 
selected inhibited rotavirus replication in at least one protocol. Bifidobacterium adolescentis CNCM 1-261 8 (NCC2705) 
showed an extremely high activity against Serotype 1 Rotavirus, Serotype 3 rotavirus SA-1 1 and Serotype 4 rotavirus 
Hochi. 

55 [0119] NCC2705 is gram positive and catalasenegative.it does not produce CO z during fermentation and produces 
just L (+) lactic acid according to methods disclosed in the "Genera of lactic acid bacteria", Ed. B.J.B. Wood and W.H. 
Holzapfel, Blackie A&P 

[0120] These results show the extreme superior properties of the Bifidobacterium of the present invention. 
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Claims 

1 . A polynucleotide having a nucleotide sequence of the Bifidobacterium genome, comprising 

(a) the nucleotide sequence of SEQ. ID. No. 1; 

(b) a nucleotide sequence exhibiting at least 90% identity with the sequence of SEQ: ID. No. 1 ; or 

(c) a nucleotide sequence that hybridizes to SEQ ID. No. 1 under stringent conditions. 

2. A polynucleotide having a nucleotide sequence selected from the group: 

(a) a nucleotide sequence as identifed by SEQ ID. NO. 1 099; or - 

(b) a nucleotide sequence exhibiting at least 95 % identity with SEQ. ID. NO. 1099; or 

(c) a polynucleotide which hybridizes to SEQ. ID. NO. 1099 under conditions of high stringency. 

3. A polynucleotide having a nucleotide sequence of an open reading frame (ORF) of a Bifidobacterium longum 
genome, comprising: 

(a) a nucleotide sequence chosen from any one of Seq ID. No. 2 to SEQ. ID. NO. 1098; or 

(b) a nucleotide sequence exhibiting at least 95 % identity with any one of SEQ ID NO 2 to SEQ ID NO 
1098; or 

(c) a polynucleotide which hybridizes to any one of SEQ. ID. N0.2toSEQ. ID. NO. 1098 under conditions of 
high stringency. 

4. The polynucleotide of any of the claims 1 or 2, which encodes the following polypeptides or fragments thereof: 



(a)SEQ.ID.No.5(ORF13) 

MPTGRVRWFDAAKGYGFITSEEGKDVFLPAQALPTGVTTLRKGAKVEYSV 
VDGRRGPQAMDVRL I ASAPSLVKATRPKADDMAAI CEDL I KMLDAAGNTL 
RRHRYP S AAD S KKLATLLRAVADQFDVQD 
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(b) SEQ. ID. No.525 (ORF1827) 

MTTAAAQAPAPGKLEFKDDYTPDEAERVIIWSKGLPVGVRPKMVWTWKKA 
LLWAAI A I VCACGWAI LAVSRGEQI SAI WFLWALS S YAI AYRFYAY YIQ 
IKIMRTDDANATPAERVHDGANFERTDRRVLFGQHFAGISGAGPLVGPIL 
AAQMGYLPSTLWI ILGVI FAGAVQDMLVLWI SAKRRGRSLGQMATDEMGK 
FGGMILS I FLVVMTAI AMAFLALVAI KAMAAS P WAVFS I GMT IPIALIMG 
CYQRFLRPGRVIETTLIiGFVLLVLDIVAGGWIASIPAVAAVFTLDAKQLV 
IALVI YSFAAAALPHWLLVTPRDYLiSTLMKIGTLVLLVIGI IIANPSVKV 
PGLTELASTSTGPTFSGNLFPFLFITIACGALSGFHGAVSSGLTPKAVEK 
ENQIRMIGYGSMLVESFTAVIALIAAITISQGVYFSTNMSAAQITAASGV 
S I SATSTPGEQADAAVKAVESMKVSDIEGNQMQVTWDSVDENGAAKTYEG 
AAALEQAAADIGETSIVSRTGGATTFAMGMANFLKSYLGGHDSMAFWYHF 
AIMFEALF I LTTVDNGTRVARYQ I GEMLGNVRKLKKFADPTWKPGNI I TT 
L I ATAL WGGLLWMGVSDANGG I NAMVP I FG I S NQL 1AAAC F VL I TVC VAK 
MGYWKHLWIPWPLVWDIAVTFTADFQKIFGPLSYFTTASKYQAQIDSGE 
LTGFjyL,TNAKA7a J SNAYLDGVIiSVFFLVMMGVFVVVGIVWARTFAAGKY 
GAETTSEEPFVESQWFAPSSLVATALEKKVQREYSAKLHELVRNGQVAA 

c) SEQ. ID. No.424 (ORF1473) 

MTQSRRMLVLRAWED Y I RS QE PVGSTS LTRDHDLGVS SAT I RNDMAALE 
DEGYLIQPHTSAGRVPTEKGYRYFVDRLATWPLSEAQRRGINSFLSGSV 
SLKDALQRSARLLS E I TGQVAVVAS PS IAKATLRHVEMVPVAMTTLLAVV 
I TDTGRVAQHGLTI ASMPAVDE I NRLSNTVNEQGDGLSLS KSAETVRS I A 
ASAGYESVRGVADTLADAFESMALDERANELYMSGTSHLAHSRSLADLAP 
LFDALEEQVVLMKIjMSNLSEETNASGVGVAIGSENIHTPGLLHASVVSSGY 
GRSGAAGEPAGNDPVGEPETESETESQTNDTEPIAFVGSIGPTHMDYAAT 
MAAVRAVARYLTAFLSEGRTQD 

d) SEQ. ID. No.548 (ORF1905) 

MYFKDGNDNAQRGGSTVRRSRQRRIMGRWSYNEDVPRCTFCGKTEHQVR 
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KLVAGPNAS ICDECIALCVDI I SEERVKDAE VNS LS LPKPAQ I FDYLNRY 
VIGQENAKRAIjSVAVYNHYKRVNMELQESAEQLEKSNNGHSGQTSKQAKQS 
VPTQTRATRRSNDPLADVEVAKSNILLIX3PTGVGKTYIAQAIiARVMNVPF 
V I TDATTLTEAG YVGDDVETVIiQRLIjQAADGDVSRAQHG I I Y IDE I DKI A 
RKSGENTSITRDVSGEGVQQALLKILEGTIASVPLEGTRKHKEQDVAQMD 
TRG I LF I CGGAF VGLTD I VRKRLGRRETGFGANWHDADMKDEELLEQVNA 
DDLAEFGLLPEFI GRLPVTSVLKELTVDDLTAI LTQPANALI KQYRKLFA 
VDGVDLQFTEQAIRAIADIAIKQGTGARGLRS 1 I ERTLQDTMFQLPSLDD 
VRQVIVDKASVEGSSTPKLLREAVDVPQGRLKVAQSVLLDRVRTHEA 

e) SEQ. ED. No. 74 (ORF219) 

MS EQLMEQ YRLRGQRKCRNAC I AAI VTWLVLAVAGGVWWTAGDGS ALVR 
NMFKPKATPATQPVWSTATFAYRTAPEFLAMEAGDRGTGNVNYSPASMW 
MALAIAAQGANGTTRSQLNELLGSGSLTDSDYQSLLSSINGQYSGAKSEM 
SAANSLWIDDDYSLASDYQSTVKKMFEAEVTTLPFDDQAAAKMSDWIAKH 
TNGSLKPKITLRDREVIiSIINTVYADGRWKDPFEEQSTGNGTFHGEAGDA 
QVPMMHQTFSQ^YGHDEYNTWQRVEIPFDNGGNLAIVLPAEGHFDELAG 
DAEKLSWAFGTCSTASLGEGAMGGAADSMPGWGVSVNSVMVNVTLPRFTI 
DSMFDSEATIKAFEKLGVTDAFSAGDADFTKMIDTGSHGENLYIGSILQG 
TRI EVNEAGAKAMSFTKVGADSVSAPVDNVEFTVDRPFLYS YVTPDGI PL 
F I GAVRNLGGVGGEN 
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f) SEQ. ED. No.576 (ORF1 972) 

MQIRPGSMYPLGASYDGAGVNFALYSQVAQKVELCLFDEHIDVETRIEMTE 
RNSYVWHNYI PGLHPGQRYGYRVYGPYDPVHGLRCNPNKLLLDPYAKAI E 
GNIDGDESLFSYWFKSPDDNSAMNDLDSAAHTMKSAVINPYFDWGNDQHP 
Y I S YHDS VI YEAHVRGMTNIjNMDVPPDI RGTYAGLAYPS VI E YLKKLGI T 
AIELMPIHQFVNDSFLQEKGLSNYWGYNTIGFFAPHNAYSSSGERGEQVN 
EFKSMVKAYHRAGMEVILDWYNHTAEGNHMGPTLSFKGIDNASYYRLVE 
GDQQHYFDTTGTGNS LLMRS PHALQL I TDS LR YWVTEMHVDGFRFDLAAT 
LARQFQEVDKLSAFFDIVEQDPIISRVKLIAEPWDLiGSGGYQVGGFPSSW 

SEWNGRYRDTVRDFWRSQPSTLPEFASRLMGSSDLYQVNGRRPVASVNFI 
TAHDGFTMNDLVS YNEKHNEANGEGNRDGESNNRSWNCGVEGPTNI PDVN 
DLRQRQMRNMFATLLFSQG I PMI CGGDEVARTQQGNNNAYCQDNEI SWTN 
WHLDKGRKE LLAF VS KL I HLRLDHPVLHRRRFFTGRE PGDDSNT I PQVEW 
FDHTGS I MDMDDWQNTHAFSMMI YLNGSD I PEVDWYGNRMVDNDFI L.I FN 
AHYEPIKFTLPDERYGRKWQLVVPTHNPNEPALSYEAGFMITAQSRSFLM 
LMSAKKPKKPMGL 

g) SEQ. ID. No.403 (ORF1403) 

MEIRARPHMASGSYRELFSARMMYGMQYIQQTIVGIDGSEARFFGYVADN 
SEEMEPDRIRPAILILPGGGYAMTSDREAEPVALQFLAKGFAVFVLRYSV 
QPSRYPVALLEAAEAMRLIRANVDQWHVNPAQVAVLGFSAGGHIoAANIAT 
S VGDED IREQGGMDPDAVRPNALMIiS YPVI TAGKYAHRGS FQCLLGDQAH 
NQALLDKFSIEKHIDAKTPPVFVTOiTMTDDAVPVENTLMLIQACRAAGVS 
IEAHLFPEGSHGLSLANAETAGNGFYAHIVECVQCWPDLAEAWLRRIiF 



65 



EP 1 227 152 A1 

(h) SEQ. ID. No.804 (ORF2676) 

MFLKPEQQLERCRRI VRQRVDPH IHPS I AQLTVES YD I PGE PMPSDEFFA 
KLDRGD I DFKPFMLGS EWGTTWGTVWFRLTGTVPAG Y PKGKPLEL I LDLG 
WYPHSCGGHIEGLVYRADGTAIKAVHPLNYWVPFMDAEGNAQVPVAEDGS 
FTLYLEAASNPLLLGVPPFIETEIiGDHATGKPDEPYVFKiSADLAEFDERY 
ENYSVDLDWSSLMEFADKQSPRYWQLAKAliQRSLNAYDEPJSTPESVEAAR 
AVLAGVLAKP ANAS AMNVS A I GHAHI DS AWLWPVRETRRKVARTVSNALA 
LMDADPDFKYAMSS AQQYAWLEEDHPDI FKRMKRRI EEGRF I PVGGMWVE 
ADGMLPAGESLIRQIAYGRKYFKEHLGVEPKGVWLPDSFGYTGAWPQIAR 
RAGYEWFLTQKISWNDTTKFPHHSFMWEGIDGSRIFTHFPPADTYAAWCK 
VQELDYAEKNFQDKDLSDRSLLLFGFGDGGGGPTRNMMEHLHRYENLEGV 
SKVSIEEPNDFFDKAHQQLAENAGPEMPVWKGELYLELHRGTLTSQQDMK 
RGCRQEESLLRTVEYLGAAAVLSDPEYVYPREELDRIWKTLLLNQFHDIL 
PGSAI AWVHREAREDYRRDLKRIJffilAQDMCAVI^KANPQADLIAEARI S 
QFRNDGASWHANRINEPTDALSVLTQTLDNGRVLLANGVLSVTIEADGTI 

SSLLDEEHGRELVPAGTRLGQYELLRDEPAVWDAWEIERESLLMANAVTG 
SIESVNTENGAAQVHVHTADGDTVITTTITLRPGSHTLDFHADIDWHERE 
RFLKVDLPLGIVADQATYDCQYGLIRRPIVKNTASDEAKYESSTNRFAII 
GDAGYAAAVINGSVYGSDASPIAGNAAEGRDSGTMFRLSLLSAPTFPDPR 
TDIGSHEFDWS WADATVDRALDAAGVLNAPVLHDVPD I TPLAS I ESVNG 
TVVLDWMKIiADDGSGDLIVRAYEAAGGQADAMLHVCPAIAGASVHETNVL 
EGDDLAADLPVALQDGRQNAEGATLHFGPFQLATLRITR 
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(i) SEQ. ID. No.3 1 3 (ORF1077) 

MI SRDGWAVIDDSAANI I IETDTVNGKANPFGTWVSPRATAETDLYFFGY 
GHRYI EAWDFYRLTGPTPLLPRFAMGNWWSRYYRYTQDGYIiALMDRFKR 
EGIPFTTSVIDMDWHRVDDVDPKYGSGWTGYSWNRELFPDPPAFLADLHR 
.RGLRTTI«NVHPRDGVRAFEDAYPEVAKRVGIDPATEENyEFDLTNPDFVt) 
AYFDMHHRMEAEGVDFWWLDWQQGGVTRQKGLDPLWMLNHMHYLDSGRGG 
NWPLTFSRYAGPGSHRYPVGFSGDTIVTWESLAFQPQFTATASNIGYGWW 
SHDIGGHMFGYRNEELEARWYQLGAFSPINRLHSSNSPFSGKEPWNFNRD 
VSAAMVDALRLRHAl^PYLYTMNYRAAEAGRPLVEPMYWQNPDTPDAYEV 
PDEFRFGTELWAFIVSPbDAAACRGRADAWLPQGEWFDFFDGRRYVSSD 
AAGRRLE VWRS LDRT P VFAKAGA I VP LQD VAE S GEAI NS I ANPQAX.RVL1V 
F PGADGS FVMREDRGTWGAPS ADTAI AFTWGGADAS P SAFTVAP VTGDTS 
AVPELRDWTWFRGVAPVDAASGVRAWSGEAPVEATVAYDEATMSLTVSV 
TGI SSAASLRIEI PGGLRIADNPVESDAMDLLLHAQMLYRTKELALQAVH 
KLGIGAIGALRTMNRGPRYANDFWITDMPDAVAGALEE I LLRS 

5. A polynucleotide encoding a fusion protein, comprising any one of SEQ. ID. NO. 2 to SEQ. ID. NO. 1 098 ligated 
in frame to a polynucleotide encoding a heterologous polypeptide. 

6. A recombinant vector that contains the polynucleotide of any of the preceding claims. 

7. A recombinant vector according to claim 6, that contains a polynucleotide of Claim 4. 

B. A recombinant vector according to any of the claims 6 or 7, wherein the polynucleotide is operatively associated 
with a regulatory sequence that controls gene expression. 

9. A host that contains a polynucleotide according to any of the Claim 1 to 5 or a recombinant vector according to 
any of the claims 6 to 8. . 

10. A host according to claim 9, that contains a polynucleotide of Claim 4 or the vector according to claim 7. 

11 . A host according to any of the claims 9 or 1 0, which is a procaryotic cell, an eucaryotic or a plant or a a non-human 
animal. 

12. A method for producing a polypeptide of Bifidobacteruim longum, comprising: 

(a) culturing a host cell according to claim 9 or claim 10 under conditions suitable to produce the polypeptide 
encoded by the. polynucleotide; and 

(b) recovering the polypeptide from the culture. 

13. A method for producing a fusion protein, comprising: 

(a) culturing a host cell according to Claim 9 or claim 10 under conditions suitable to produce the fusion protein 
encoded by the polynucleotide; and ■ < ' 
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(b) recovering the fusion protein from the culture. 

14. A polypeptide encoded by a polynucleotide according to any of the claims 3 to 5. 

15. A fusion protein encoded by a polynucleotide of Claim 5. 

16. An antibody that immunospecifically binds to a polypeptide of any of the claims 14 or 15. 

17. An antibody that immunospecrfically binds to the fusion protein of claim 5. 

18. A method for the detection and/or identification of Bifidobacterium longum in. a biological sample, comprising: 



(a) contacting the sample with a polynucleotide primer derived fom a sequence according to any of the claims 
1 to 4 in the presence of a polymerase enzyme and nucleotides under conditions which permit primer extension- 

is and 

(b) detecting the presence of primerextension products in the sample in which the detection of primer extension 
products indicates the presence of Bifidobacteruim longum in the sample. 

19. A method for the detection and/or identification of Bifidobacterium in a biological sample, comprising: 

(a) contacting the sample with a probe derived from a polynucleotide according to any of the claims 1 to 8 
under conditions which permit hybridization of complementary base pairs; and 

(b) detecting the presence of hybridization complexes in the sample in which the detection of hybridization 
complexes indicates the presence of Brfodobacterium longum in the sample. 

20. A method for the detection and/or identification of Bifodobacterium in a biological sample, comprising: 

(a) contacting the sample with an antibody according to Claim 1 6 or 1 7 under conditions suitable for the for- 
mation of immune complexes; and 

(b) detecting the presence of immune complexes in the sample, in which the detection of immune complexes 
indicates the presence of Bifodobacterium longum in the sample. 

21. A method for the detection and/or Identification of antibodies to Bifodobacterium longum in a bloloqical sample 
comprising: K ' 

(a) contacting the sample with a polypeptide according to any of the claims 14 or 1 5 under conditions.suitable 
for the formation of immune complexes; and 

(b) detecting the presence of immune complexes in the sample, in which the detection of Immune complexes 
indicates the presence of Bifodobacterium longum in the sample 

40 

22. A DNA array/chip containing an array of polynucleotides comprising at least a polynucleotide according to any of 
the claims 1 to 4 or a vector according to any of the claims 6 to 8. 

23. A protein array/chip containing an array of polypeptides comprising at least one of the polypeptides according to 
45 any of the claims 14 or 15. 

24. An immunogenic composition comprising a polypeptide ccording to any of the claims 14 or 15 and a pharmaceu- 
tically acceptable earner. 

so 25. An immunogeneic composition comprising a polypeptide according to any of the claims 14 or 15 and a pharma- 
ceutical ly acceptable carrier. 



26. A screening assay, comprising: 

(a) contacting a testcompoi 
to any of the claims 6 to 8; , 

(b) detecting whether binding occurs. 



55 (a) contacting a test compound with a polynucleotide according to any of the claims 1 to 4 or a vector according 

to any of the claims 6 to 8; and 
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27. A screening assay, comprising: 

(a) contacting a test compound with a polypeptide according to any of the claims 14 or 15; and 

(b) detecting whether binding occurs. 

28. A kit comprising a container containing a polynucleotide according to any of the claims 1 to 4. 

29. The kit according to claim 28, wherein the polynucleotide is a primer or a probe and wherein the kit optionally 
contains a polymerase and deoxynucleotide triphosphates. 

30. A. kit comprising a container containing an antibody that immunospeciflcally binds to a polypeptide according to 
any of. the claims 14 and 15: 

31. A computer readable medium having recorded thereon a nucleic acid sequence according to any of the claims 1 
15 to 8. 

32. A computer readable medium having recorded thereon a polypeptide sequence according to any of the claims 1 4 
or 15. 

so 33. The computer readable medium according to any of the claims 31 or 33, wherein said medium is selected from 
the group consisting of: 

(a) a floppy disc; 

(b) a hard disc; 

25 .(c) random access memory (RAM); 

(d) read only memory (ROM); and 

(e) CD-ROM. 

34. A computer-based system for identifying fragments of the Bifidobacterium longum genome comprising the following 
30 elements: 

(a) a data storage means comprising a nucleic acid sequence according to any of the claims 1 to 4; 

(b) search means for comparing a target sequence to the nucleotide sequence of the data storage means of 
step (a) to Identify homologous sequence (s); and 

35 (c) retrieval means (or obtaining said homologous sequence (s) of step (b). 

35. Lactic acid bacterium, which Is Bifidobacterium NCC2705(CNCM 1-2618).. 

36. Use of the Bifidobacterium according to claim 35, or a culture supernatant thereof, or a metabolite thereof for the 
40 preparation of a carrier. 

37. The use according to claim 36, wherein the Bifidobacterium is contained in the earner in an amount of from about 
10 s cfu / g to about 10 12 cfu / g earner material. 

38. The use according to any of the claims 36 and 37, wherein the carrier is for the prevention and/or treatment of 
diearrohea brought about by pathogenic bacteria and/or rotaviruses. 

39. The use according to any of the claims 36 to 38, wherein the earner is a food composition selected from milk, 
yogurt, curd, cheese, fermented milks, milk based fermented products, ice-creams, fermented cereal based prod- 
so ucts, milk based powders, infant formulae, pet food or a pharmaceutical composition selected from tablets, liquid 

bacterial suspensions, dried oral supplement, wet oral supplement, dry tube feeding or wet tubefeeding. 
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The Search Division considers that the present European patent application does not comply with the 
requirements of unity o! invention and relates to several inventions or groups ot inventions, namely: 




Invention 1 (Claims 1,6,8,9,11-13,18,19,22,26,28, 
29 partly) 



-A polynucleotide having a nucleotide sequence of the 
Bifidobacterium, comprising the nucleotide sequence of SEQ 
10. NO. : 1, a nucleotide sequence exhibiting at least 90X 
Identity with the SEQ. ID. M0.: 1 or a nucleotide sequence 
that hybridises to SEQ. ID. NO. : 1 under stringent 
conditions, a recombinant vector that contains a 
polynucleotide as above, a host that contains polynucleotide 
as above or a vector as above, a method of producing a 
polypeptide of Bifidobacterium longum and a fusion 
polypeptide thereof, a method of detecting and/or 
identifying Bifidobacterium in biological sample with help 
of the polynucleotide as above, a DNA array/chip containing 
an array of polynucleotides comprising a polynucleotide as 
above, a screening assay for a compound binding to 
polynucleotide as above and a kit containing a nucleotide as 
above. 



Invention 2 (Claims 2,6,8,9,11-13,18,19,22,26,28, 
29 partly) 

-A polynucleotide having a nucleotide sequence selected from 
the group: a nucleotide sequence as identified by SEQ. ID. 
NO. 1099 or a nucleotide sequence exhibiting at least 95X 
Identity with SEQ. ID. NO. : 1099 or polynucleotide which 
hybridises to SEQ. ID. NO. : 1099 under conditions of high 
stringency, a recombinant vector that contains a 
polynucleotide as above, a host that contains a 
polynucleotide as above or a vector as above, a method of 
producing a polypeptide of Bifidobacterium longum and a 
fusion protein thereof, a method of detecting or identifying 
Bifidobacterium In biological samples with help of 
polynucleotides as above, a DNA array/chip containing an 
array pf polynucleotides as above and a kit containing a 
nucleotide as above. 

Invention 3 (Claims, 3-6,8,9,11-30 partly) 

-A polynucleotide having a nucleotide sequence of an open 
reading frame of Bifidobacterium longum genom comprising 
the nucleotide sequence SEQ. ID. NO. : 2, a nucleotide 
sequence exhibiting at least 95* identity with that encoding 
SEQ. ID. NO. : 2, a nucleotide sequence that hybridises to 
that encoding SEQ. ID. NO.: 2 under stringent conditions, a 
polynucleotide encoding a fusion protein of that of SEQ. ID. 
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The Search Division considers that the present European patent application does not comply with the 
requirements of unity of invention and relates to several inventions or groups of Inventions, namely. 

NO.: 2 and a heterologous polypeptide, a corresponding 
recombinant vectors and hosts , a method of producing a 
polypeptide of SEQ ID HO. : 2 and said fusion polypeptide, 
polypeptides and fusion polypeptides characterised by SEQ. 
ID. N0.:2, corresponding inraunospeclflc antibodies, a method 
of detection and/or identification of Bifidobacterium 1n 
biological samples based on said nucleic adds, polypeptides 
and/or antibodies, a DMA array/chip containing an array of 
polynucleotides comprising a polynucleotide encoding the 
SEQ. ID. NO. :2, having at least 90% Identity to that 
encoding SEQ. ID. NO. : 2 or hybridising under high 
stringency with such a sequence or comprising corresponding 
vector, a protein chip/array containing an array of 
polypeptides comprising that of SEQ. ID. NO.: 2, an 
Immunogenic composition comprising a polypeptide of SEQ. ID. 
NO. :2, a screening assay for compounds binding to the 
polypeptide of SEQ. ID. NO. : 2 or the corresponding 
polynucleotides, a kit containing a corresponding 
polynucleotide or a specific antibody. 

Inventions 4 to 1091 (Claims 3-6,8,9,11-30 partly) 

-Idem as subject three but in reference ot SEQ. ID. NO.: 3, 
4, 6 to 73, 75 to 312, 314 to 402, 404 to 423, 425 to 524, 
526 to 547, 549 to 575, 577 to 803, 805 to 1098. 

Inventions 1092 to 1100 (Claims 4,6-14,16, 
18-30 partly) 



-Idem as subject 4 but In reference to SEQ. ID. N0.:5. 74, 
313, 403, 424, 525, 548, 576, 804. 

Invention 1101 (Claims 31 to 34) 

-Computer readabl e ned1 um and computer-based system for 
identifying fragments of the Bifidobacterium longua genome. 

It should be noted that the above claims have as a subject 
matter computer readable medium and a computer based system 
which appear to constitute merely ways of presenting 
information and as such would not be regarded as patentable 
inventions In the sense of Art. 52.1 EPC. Subject matter 
excluded from patentability under Art. 52.1 EPC needs not be 
a subject of the search. 

Invention 1102 (Claims 35 to 39) 

-A strain of lactic acid bacterium and the uses thereof. 
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The Search Division considers that the present European palenl application does not comply with the 
requirements ot unity of invention and relates to several inventions or groups of inventions, namely: 

The application has been divided Into the above (groups of) inventions 
which individually are considered to meet the requirement of unity. If 
additional fees are paid for (one or more of) the, as yet, unsearched 
invention(s), the subsequent search(es) might reveal prior art which 
leads to a finding of lack of unity a posteriori within (one or more of) 
the, as yet, unsearched inventlon(s). Should this be the case, as a 
rule, no further invitation to pay additional fees will be issued. Only 
the first Identified invention in each group of inventions, for which 
additional search fees have been paid in due time and which subsequently 
is considered to lack unity, will be searched. 
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